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ALIGNMENTS 



RESULT 1 
AAU74904 

ID AAU74904 standard; protein; 317 AA. 
XX 

AC AAU74 904; 
XX 

DT 09-APR-2002 (first entry) 
XX 

DE Amino acid sequence of mouse G-protein coupled receptor TGR18 protein. 
XX 

KW Mouse; G-protein coupled; receptor; GPCR; TGR18; kidney disease; 

KW signal transduction modulator; cerebral cavernous malformation; 

KW hyperlipidemia; obesity; dyslexia; cardiac myxoma; renal failure; 

KW nephritis; hypertension; liver disease; cirrhosis; blood disorder; 

KW spleen-associated disorder; immune disorder. 
XX 



OS Mus sp. 
XX 

PN WO200200719-A2 . 
XX 

PD 03-JAN-2002. 
XX 

PF 25-JUN-2001; 2001WO-US02 0363 . 
XX 

PR 23-JUN-2000; 2000US-02 13461P . 
XX 

PA (TULA-) TULARIK INC. 
XX 

PI Lin DC, Zhao J, Chen J, Cutler G; 
XX 

DR WPI; 2002-147880/19. 

DR N-PSDB; ABK12957. 
XX 

PT New G-protein coupled receptor polypeptides, useful for identifying 

PT modulators of signal transduction for treating kidney disease, 

PT hyperlipidemia, obesity, dyslexia and cardiac myxoma. 
XX 

PS Claim 33; Page 59; 78pp; English. 
XX 

CC The present invention relates to a new G-protein coupled receptor (GPCR) 

CC polypeptide comprising greater than 70% amino acid sequence identity to 

CC the amino acid sequence of human GPCRs TGR62, TGR21, TGR130.1, TGR130.2, 

CC human TGR213 or TGR92, 80% amino acid sequence identity to mouse TGR18 or 

CC 90% amino acid sequence identity to human novel edg receptor protein, as 

CC defined in the specification. The GPCR covalently linked to a solid phase 

CC is useful for identifying a compound that modulates signal transduction. 

CC The identified compounds are useful for treating kidney disease, cerebral 

CC cavernous malformations, hyperlipidemia, obesity, dyslexia and cardiac 

CC myxoma. The molecules of the invention are useful for diagnosing 

CC disorders or conditions such as kidney-related conditions or diseases 

CC such as renal failure, nephritis, nephrotic syndrome, asymptomatic 

CC urinary abnormalities, renal tubule defects, hypertension and 

CC nephrolithiasis, liver-related disease or condition e.g. cirrhosis, 

CC infiltrations, lesions, functional disorders and jaundice and spleen- 

CC associated disorders or conditions e.g. splenic enlargement, immune 

CC disorders, blood disorders and others. Modulation of the polypeptide of 

CC the invention is useful to treat or prevent any of the above conditions 

CC or diseases. The present amino acid sequence represents the mouse GPCR 

CC TGR18 protein of the invention. This sequence is one of seven novel G 

CC protein coupled receptors of the invention (AAU74904- AAU74911) 

XX 

SQ Sequence 317 AA; 

Query Match 100.0%; Score 1650; DB 5; Length 317; 

Best Local Similarity 100.0%; Pred. No. 4.5e-159; 

Matches 317; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFI FGLLGNVTVVFGYLFCMKNWNSSNVYLFN 60 

N I M I I I I i I M I I I I I I I I I I I I I I I || I I I I | | || | | | M I I I I I I I I I I I 

Db 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFI FGLLGNVTVVFGYLFCMKNWNSSNVYLFN 60 



Qy 
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growth-related disease; cell regeneration-related disease; AIDS; cancer; 
immunological-related cell proliferative disease; autoimmune disease; 
Alzheimer's disease; atherosclerosis; infection; osteoarthritis; allergy; 



RESULT 2 
ABP81696 

ID ABP81696 standard; protein; 330 AA. 
XX 

AC ABP81696; 
XX 

DT 04-MAR-2003 (first entry) 
XX 

DE Human purinergic receptor P2U2 protein SEQ ID NO: 567. 
XX 

KW G protein-coupled receptor; GPCR; antigenic peptide; gene therapy; 
KW G protein-coupled receptor modulator; antibody; immune-related disease; 
KW 
KW 
KW 

KW osteoporosis; cardiomyopathy; inflammation; Crohn's disease; diabetes; 
KW graft versus host disease; Parkinson f s disease; multiple sclerosis; pain; 
KW psoriasis; anxiety; depression; schizophrenia; dementia; memory loss; 
KW mental retardation; epilepsy; asthma; tuberculosis; obesity; nausea; 
KW hypertension; hypotension; renal disorder; rheumatoid arthritis; trauma; 
KW ulcer. 
XX 

OS Homo sapiens . 
XX 

PN WO200261087-A2. 
XX 

PD 08-AUG-2002. 
XX 

PF 19-DEC-2001; 2 001WO-US050107 . 
XX 

PR 19-DEC-2000; 2 00 OUS-0257 144 P . 
XX 

PA (LIFE- ) LIFESPAN BIOSCIENCES INC. 
XX 

PI Burmer GC, Roush CL, Brown JP; 
XX 

DR WPI; 2003-046718/04. 
DR N-PSDB; ABZ42542. 



PT New isolated antigenic peptides e.g., for G protein-coupled receptors 

PT (GPCR), useful for diagnosing and designing drugs for treating conditions 

PT in which GPCRs are involved, e.g. AIDS, Alzheimer ! s disease, cancer or 

PT autoimmune diseases . 

XX 

PS Disclosure; Fig 1; 523pp; English. 
XX 

CC The present invention describes antigenic peptides (I) comprising: (a) 

CC any one of 1601 sequences (see ABP82019 to ABP83619) of 12-24 amino 

CC acids. Also described: (1) an assay for the detection of a particular G 

CC protein-coupled receptor (GPCR) or a candidate polypeptide in a sample; 

CC and (2) an isolated antibody having high specificity and high affinity or 

CC avidity for a particular GPCR. (I) can be used as GPCR modulators and in 

CC gene therapy. The antigenic peptides for GPCRs are useful in detecting an 

CC antibody against a particular GPCR, and in the production of specific 

CC antibodies. The peptides and antibodies are also useful for detecting the 

CC presence or absence of corresponding GPCRs. The antigenic peptides for 

CC GPCRs and antibodies are useful for diagnosing and designing drugs for 

CC treating immune- related diseases, growth-related diseases, cell 

CC regeneration-related disease, immunological-related cell proliferative 

CC diseases, or autoimmune diseases, e.g. AIDS, Alzheimer's disease, 

CC atherosclerosis, bacterial, fungal, protozoan or viral infections, 

CC osteoarthritis, osteoporosis, cancer, cardiomyopathy, chronic and acute 

CC inflammation, allergies, Crohn 1 s disease, diabetes, graft versus host 
disease, Parkinson's disease, multiple sclerosis, pain, psoriasis, 
anxiety, depression, schizophrenia, dementia, mental retardation, memory 

CC loss, epilepsy, asthma, tuberculosis, obesity, nausea, hypertension, 

CC hypotension, renal disorders, rheumatoid arthritis, trauma, ulcers, or 

CC any other disorder in which GPCRs are involved. The antibodies may be 

CC used in immunoassays and immunodiagnosis . ABZ42523 to ABZ42869 encode 

CC GPCR proteins given in ABP81675 to ABP82018, which are used in the 

CC exemplification of the present invention 
XX 

SQ Sequence 330 AA; 



CC 
CC 



Query Match 75.1%; Score 1238.5; DB 6; Length 330; 

Best Local Similarity 72.2%; Pred. No. 3.3e-117; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

QY 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNVTVVFGYLFCMKNWNSSNVYLFN 60 

M I =1 = 1(11 M I Mill II III: I : I I | I I : I | : | : II I I I || : I I I I 
Db 1 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIVVYGYIFSLKNWNSSNIYLFN 60 

QY 61 LS I SDFAFLCTLPI LI KSYANDKGTYGDVLCISNRYVLHTNLYTS I LFLTFISMDRYLLM 120 

I I : I I I M I I I I : I I : I I II I I I I I II I I I I I I I I I I I | | | | | | | | | : | | | | : : 

Db 61 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 120 



QY 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

M I I I II I I I I I I I I I I I I | : | | | | | | : | | : | | : | : | | : | | | | : | : | 

Db 121 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 180 

QY 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

1111:111111111 Mill: :|||:|::| I I I t I 1 : | I ||::||MI|:| 

Db 181 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 240 



Qy 



241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 



M I I I : I ! I : I II I I I II I I I I I | I : I I I I I I | | | I II : I I I I : II I : I : 
Db 241 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 300 

Qy 3 00 MLISKFRQYFKSLTSF 315 

I I : : : I I I I I I I I 
Db 3 01 MLMNQLRHN FKS LT S F 316 
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AAB02842 




ID 


AAB02842 standard; protein; 334 AA. 
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V V 
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Human G protein coupled receptor hCHNIO protein SEQ ID NO: 38, 


vv 
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VTiT 


Human; G protein coupled receptor; GPCR; transmembrane receptor; 


i\W 


identification; agonist; screening; therapeutic; pharmaceutical; mutant. 


V V 
AA 






Ub 


Homo sapiens. 


vv 
AA 








WO200022131 


-A2. 


vv 
AA 






nn 
tru 


20-APR-2000 




V V 
AA 






D IT 

rr 


13-OCT-1999; 99WO-US024065 . 


V V 
AA 






DD 


13-OCT-1998 


98US-00170496. 


13 D 


12-NOV-1998 


98US-0108029P. 


rK 


20-NOV-1998 


; 98US-0109213P. 


c K 


27-NOV-1998 


98US-0110060P. 


DD 
rK 


16-FEB-1999 


99US-0120416P. 


DD 
rK 


26-FEB-1999 


99US-0121852P. 


DD 
IT K 


12-MAR-1999 


99US-0123944P. 


DD 
rK 


12-MAR-1999 


} 99US-0123945P. 


DD 
r r\ 


12-MAR-1999 


99US-0123946P. 


DD 


12-MAR-1999 


99US-0123948P. 


DD 


12-MAR-1999 


? 99US-0123949P. 


PR 
r r\ 


12-MAR-1999 


? 99US-0123951P. 


PD 


28-MAY-1999, 


; 99US-0136436P. 


PR 


28-MAY-1999, 


? 99US-0136437P. 


PR 


28-MAY-1999, 


? 99US-0136439P. 


PR 


28-MAY-1999, 


99US-0137127P. 


PR 


28-MAY-1999, 


99US-0137131P. 


PR 


28-MAY-1999, 


99US-0137567P. 


PR 


29-JUN-1999, 


99US-0141448P. 


PR 


27-AUG-1999, 


99US-0151114P. 


PR 


03-SEP-1999, 


99US-0152524P. 


PR 


29-SEP-1999; 


99US-0156555P. 


PR 


29-SEP-1999; 


99US-0156633P. 


PR 


29-SEP-1999; 


99US-0156634P. 


PR 


29-SEP-1999; 


99US-0156653P. 


PR 


01-OCT-1999; 


99US-0157280P. 


PR 


01-OCT-1999; 


99US-0157281P. 


PR 


01-OCT-1999; 


99US-0157282P. 



PR 01-OCT-1999; 99US-0157293P . 

PR Ol-OCT-1999; 99US-0157294P . 

PR 12-OCT-1999; 99US-00416760 . 

PR 12-OCT-1999; 99US-00417044 . 
XX 

PA (AREN-) ARENA PHARM INC. 
XX 

PI Behan DP, Lehmann-Bruinsma K, Chalmers DT, Chen R, Dang HT; 

PI Gore M, Liaw CW, Lin I, Lowitz K, White C; 

XX 

DR WPI; 2000-317986/27. 

DR N-PSDB; AAA46036. 
XX 

PT Non-endogenous, human G protein-coupled receptors for screening receptor, 



inverse or partial agonists useful as therapeutic agent 



s , 



PT 
XX 

PS Example 1; Page 117-118; 187pp; English. 
XX 

CC The present invention describes transmembrane receptors, preferably human 

CC G protein coupled receptors (GPCR) , for which the endogenous ligand is 

CC unknown (orphan GPCR receptors). More specifically the present invention 

CC relates to non-endogenous, cons titutively activated versions of a human 

CC GPCR. These non-endogenous human GPCRs can be useful for the direct 

CC identification of candidate compounds as receptors agonists, inverse 

CC agonists or partial agonists for use as pharmaceutical agents. AAA46017 

CC to AAA46126 and AAB02825 to AAB02859 represent sequences used in the 

CC exemplification of the present invention 
XX 

SQ Sequence 334 AA; 

Query Match 75.1%; Score 1238.5; DB 3; Length 334; 

Best Local Similarity 72.2%; Pred. No. 3.4e-117; 

Matches 22 8; Conservative 42; Mismatches 45; Indels . 1; Gaps 1; 



Qy 


1 


MAQNLSCENWIATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 
M 1 =1 = 1111 M 1 1 1 1 1 1 II III: hill 1 1 : 1 1 : 1 : 1 1 I 1 1 1 1 : 1 I I I 
mWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 


60 


Db 
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64 


Qy 


61 


LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 
M : 1 1 1 1 1 1 II 1 : 1 1 : 1 1 1 I 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I : I I I I : : 
LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 


120 


Db 


65 


124 


Qy 


121 


KYPFREHFLQKKEFAILISLAVWALVTLEVXPMLTFINSVPKEEGSNCIDYASSGNPEHN 
1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 : 1 I I M I : | | : | || 1 : 1 : 1 1 : 1 I I I : 1 : | 
KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 


180 


Db 


125 


184 


Qy 


181 


LIYSLCLTLLGFLIPLSV^CFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 
1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 : : II 1 : 1 : : 1 1 I I I I I : | | 1 1 :: I I I I 1 1 : I 
LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 


240 


Db 


185 


244 


Qy 


241 


FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 
1 M 1 1 : 1 1 1 : 1 I I I M || | IN | | | : | | | | | | | | | I | 1 : || | : : | | : | : 
FTPYHVMRNVRIAS RLGSWKQYQCTQVVINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 


299 


Db 


245 


304 


Qy 


300 


MLISKFRQYFKSLTSF 315 
1 1 = 1 1 1 1 1 1 1 1 
MLMNQLRHNFKSLTSF 32 0 




Db 


305 





RESULT 4 
AAY71308 



ID 


AAY71308 standard; protein; 334 AA. 


vv 






is. r* 


AAY71308; 




v V 
AA 






nm 

JJ i 


02-NOV-2000 


(first entry) 


V V 

AA 








Human orphan G protein-coupled receptor hCHNlO. 


XX 








Human; orph 


an G protein-coupled receptor; GPCR; hCHNIO; drug screening, 


KW 


transmembrane receptor; expressed sequence tag; EST; signal cascade. 


VV 

AA 






Ub 


Homo sapiens. 


AA 






FN 


\kjr\ onnnoi o c o 
ViUZ U UUol^O o 




XX 






DO 

rJJ 


Uz- JUN-zOOO 


• 


vv 

AA 






PF 


io-ocT-iy y y 


; 99WO-US023687 . 


XX 






PR 


Z U -NOV- 19 9 8 


; 98US-0109213P . 


DO 

rK 


1 C ttitti o i n n n 

lo-r hij- iy y y 


; 99US-0120416P . 




Zb-r EB-1999 


99US-0121852P . 


PR 


12-MAR-1999 


? 99US-0123946P. 


PR 


12-MAR-1999 


? 99US-0123949P. 


D D 

rK 


28-MAY-1999 


? 99US-0136436P. 


DO 


28-MAY-1999 


: 99US-0136437P. 


PR 


28-MAY-1999 


? 99US-0136439P. 


rK 


28-MAY-1999, 


r 99US-0136567P. 


rK 


28-MAY-1999, 


? 99US-0137127P. 


rK 


28-MAY-1999, 


99US-0137131P. 


"DO 


29-JUN-1999, 


99US-0141448P. 


rK 


29-SEP-1999, 


99US-0156555P. 


rK 


29-SEP-1999, 


99US-0156633P. 


FK 


29-SEP-1999, 


99US-0156634P. 


DO 


29-SEP-1999, 


99US-0156653P. 


rK 


01-OCT-1999, 


99US-0157280P. 


ru 


01-OCT-1999; 


99US-0157281P. 


PR 


01-OCT-1999; 


99US-0157282P. 


DO 

r K 


01-OCT-1999; 


99US-0157293P. 


D O 

rK 


01-OCT-1999; 


99US-0157294P. 


r K 


12-OCT-1999; 


99US-00416760. 


PR 


12-OCT-1999; 


99US-00417044. 


XX 






PA 


(AREN-) ARENA PHARM INC. 


XX 






PI 


Chen R, Dang HT, Liaw CW, Lin I; 


XX 






DR 


WPI; 2000-400068/34. 


DR 


N-PSDB; AAD01135. 


XX 






PT 


Novel human 


orphan G protein-coupled receptors and the encoding cDNAs f 


PT 


use in the identification of G protein-coupled receptor agonists. 


XX 







PS Claim 70; Page 87-88; 102pp; English. 
XX 

CC The present amino acid sequence is the hCHNlO, an endogenous human orphan 

CC G protein-coupled receptor (GPCR), expressed in kidney and thyroid. The 

CC hCHNIO cDNA was identified using the human EST (expressed sequence tag) 

CC 1365839 as a probe. The orphan GPCR of the invention, like all GPCRs has 

CC seven transmembrane alpha helices with an extracellular N-terminus and an 

CC intracellular C-terminus . However, no endogenous ligands has yet been 

CC identified for the proteins of the invention. The orphan GPCRs may be 

CC used in the identification of their endogenous ligands, and to screen 

CC potential GPCR agonists and antagonists for use as pharmaceutical agents. 

CC The proteins may also be used in the study of GPCR-mediated signalling 

CC cascades, and to elucidate their precise role in normal and diseased 

CC human conditions. Nucleic acid encoding human orphan GPCRs may be used 

CC for tissue localisation expression analysis to provide information about 

CC their function in healthy and pathological states 
XX 

SQ Sequence 334 AA; 

Query Match 75.1%; Score 1238.5; DB 3; Length 334; 

Best Local Similarity 72.2%; Pred. No. 3.4e-117; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I : I : M I I II I I I I I I II III: | : | | | I I : I I : I : I I I II I I : I I I I 
Db 5 mWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIVVYGYIFSLKNWNSSNIYLFN 64 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

N ' I I I II I I I I : I I : I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I : I I M : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFIS1DRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I I I M I II II I I I I I I I | I : I I I I I I : | | : | || I : | : ( I : I | | I : | : | 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

I I I I : I M I I I I I I I I MINIM: : I I I : I : : I I I I II I : I I I I : : || II I I : I 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSI YTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

M I I I : I I I : I I II I I II I I I I | | | : I I I I | | | || I M : I I I I : I I I : | : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I I I M I 
Db 305 MLMNQLRHN FKS LT S F 32 0 



RESULT 5 
ABB90381 

ID ABB90381 standard; protein; 334 AA. 
XX 

AC ABB90381; 
XX 

DT 24-MAY-2002 (first entry) 
XX 

DE Human polypeptide SEQ ID NO 2757. 



KW Cytostatic; immunosuppressive; nootropic; neuroprotective; antiviral; 

KW antiallergic; hepatotropic; antidiabetic; antiinflammatory; antiulcer; 

KW vulnerary; anticonvulsant; antibacterial; antifungal; antiparasitic; 

KW cardiant; gene therapy; cancer; immune disorder; cardiovascular disorder 

KW neurological disease; infection; human; secreted protein. 

XX 

OS Homo sapiens. 
XX 

PN WO200190304-A2. 
XX 

PD 29-NOV-2001. 
XX 

PF 18-MAY-2001; 2001WO-US016450 . 
XX 

PR 19-MAY-2000; 2000US-0205515P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Birse CE, Rosen CA; 
XX 

DR WPI; 2002-122018/16. 

DR N-PSDB; ABL90790. 
XX 

PT Novel 1405 isolated polypeptides, useful for diagnosis, treatment and 

PT prevention of neural, immune system, muscular, reproductive, 

PT gastrointestinal, pulmonary, cardiovascular, renal and proliferative 

PT disorders. 

XX 

PS Claim 11; SEQ ID NO 2757; 2081pp + Sequence Listing; English. 
XX 

CC The invention relates to novel genes (ABL8 9449-ABL90853 ) and proteins 

CC (ABB89040-ABB90444) useful for preventing, treating or ameliorating 

CC medical conditions e.g. by protein or gene therapy. The genes are 

CC isolated from a range of human tissues disclosed in the specification. 

CC The nucleic acids, proteins, antibodies and (ant ) agonists are useful in 

CC the diagnosis, treatment and prevention of: (a) cancer, e.g. breast and 

CC ovarian cancer and other cancers of the adrenal gland, bone, bone marrow, 

CC breast, gastrointestinal tract, liver, lung, or urogenital; (b) immune 

CC disorders e.g. Addison 1 s disease, allergies, autoimmune haemolytic 

CC anaemia, autoimmune thyroiditis, diabetes mellitus, Crohn's disease, 

CC multiple sclerosis, rheumatoid arthritis and ulcerative colitis; (c) 

CC cardiovascular disorders such as myocardial ischaemias; (d) wound healing 

CC ; (e) neurological diseases e.g. cerebral anoxia and epilepsy; and (f) 

CC infectious diseases such as viral, bacterial, fungal and parasitic 

CC infections. Note: The sequence data for this patent did not form part of 

CC the printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences 

XX 

SQ Sequence 334 AA; 

Query Match 75.1%; Score 1238.5; DB 5; Length 334; 

Best Local Similarity 72.2%; Pred. No. 3.4e-117; 

Matches 22 8; Conservative 42; Mismatches 45; Indels 1; Gaps 1 

QY 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I :|:MM II I Mill I |||: |:||| ||:||:| :||IMM:IIII 



Db 



5 MAWNATCKNWLAAEIAALEKYYLSIFYGIEFVVGVLGNTIVVYGYIFSLKNWNSSNIYLFN 64 



QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

11 = 11 I I I I I I I : I I : | | | | | | | | | M I I I II i I II I I I II I II I I I : I I | | : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLKANLYTSILFLTFISIDRYLII 124 

QY 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

M I I I I I I I I I I I I I I I I I I : I II I I I : I I : I III : I : | | : | | | | : | : | 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

QY 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

I I I I : I I I I I I I I I I I I I I I I I I I : : | | | : | : : | I I M I I : M I I : : I I I I I I : I 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

QY 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSI YTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I I I : I I I : I I I I I I II I III I | | : | | | | | | | | | I I | : | | | | : | | | : | : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : :: I II I I I II 
Db 305 MLMNQLRHNFKSLTSF 320 



RESULT 6 






ABU92276 








ABU92276 standard; protein; 334 AA. 


vv 








AC 


ABU92276; 






XX 








DT 


16-JUL-2003 




(first entry) 


XX 








DE 


Human G protein-coupled receptor hCHNlO. 


XX 








KW 


Human; receptor; orphan G protein-coupled receptor; GPCR; hARE-3; hARE-4; 


KW 


hARE-5; hRUP3; hRUP5; hRUP6; hRUP7; hGPCRZ7; hARE-1; hARE-2; hPPRl; hG2A; 


KW 


hCHN3; hCHN4 ; 


hCHN6; hCHN8; hCHN9; hCHNIO; hRUF4 ; signalling cascade. 


XX 






OS 


Homo sapiens . 




XX 








PN 


US2003017528- 


Al. 


XX 








PD 


23-JAN-2003 






XX 








PF 


06-JUN-2001; 


2001US-00875076. 


XX 








PR 


20-NOV-1998 




98US-0109213P. 


PR 


16-FEB-1999 




99US-0120416P. 


PR 


26-FEB-1999, 




99US-0121852P. 


PR 


12-MAR-1999, 




99US-0123946P. 


PR 


12-MAR-1999, 




99US-0123949P. 


PR 


28-MAY-1999, 




99US-0136436P. 


PR 


28-MAY-1999, 




99US-0136437P. 


PR 


28-MAY-1999, 




99US-0136439P. 


PR 


28-MAY-1999, 




99US-0136567P. 


PR 


28-MAY-1999, 




99US-0137127P. 


PR 


28-MAY-1999, 




99US-0137131P. 


PR 


29-JUN-1999, 




99US-0141448P. 



PR 28-SEP-1999; 99US-0156333P . 

PR 29-SEP-1999; 99US-0156555P . 

PR 29-SEP-1999; 99US-0156634P . 

PR 12-OCT-1999; 99US-004 17044 . 
XX 

PA (CHEN/) CHEN R. 

PA (DANG/) DANG H T. 

PA (LIAW/) LIAW C W. 

PA (LINI/) LIN I. 
XX 

PI Chen R, Dang HT, Liaw CW, Lin I; 
XX 

DR WPI; 2003-428952/40. 

DR N-PSDB; ACA93273. 
XX 

PT Novel endogenous, orphan, human G protein-coupled receptors useful for 

PT identification of modulators of the receptor and as research tools for 

PT understanding the role of the receptor in human body. 
XX 

PS Claim 70; Page 41-42; 54pp; English. 
XX 

CC The invention relates to a human G protein-coupled receptor (GPCR) 

CC appearing as ABU92259-ABU92277 (encoded by cDNAs ACA932 56-ACA93274 ) named 

CC hARE-3, hARE-4 , hARE-5, hRUP3, hRUP5, hRUP6, hRUP7, hGPCRZ7, hARE-1, hARE 

CC -2, hPPRl, hG2A, hCHN3, hCHN4 , hCHN6, hCHN8, hCHN9, hCHNlO and hRUF4 . 

CC Also included are a plasmid comprising a vector and one of the cDNAs 

CC above and a host cell comprising the plasmid. The GPCRs are useful for 

CC the direct identification of candidate compounds as inverse agonists, 

CC agonists or partial agonists. In vitro and in vivo systems incorporating 

CC GPCRs is useful for elucidating and understanding the roles these 

CC receptors play in the human condition, both normal and diseased, as well 

CC as understanding the role of constitutive activation as it applies to 

CC understanding the signalling cascade. The cDNAs are useful for making a 

CC probe for dot-blot analysis against tissue mRNA and/or RT-PCR 

CC identification of the expression of the receptor in tissue samples. The 

CC present sequence represents a GPCR of the invention 

XX 

SQ Sequence 334 AA; 

Query Match 75.1%; Score 1238.5; DB 6; Length 334; 

Best Local Similarity 72.2%; Pred. No. 3.4e-117; 

Matches 22 8; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

QY 1 MAQNL S CENWLAT EAI LN K YYL S AF YAI E FI FGLL GNVT VVFG YL FCMKNWN S SNVYL FN 60 

II I : I : I I I I M I I I I I I M III: hill I I : I I : I : I I I I I I I : I I || 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFVVGVLGNTIVVYGYIFSLKNWNSSNIYLFN 64 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

I I : I I I I I I II I : I I : I I I I I I I I I I I I I I M I I I I I I I I I I I M I I : I I I I : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I M I II I I I I I I I I I | | | | : | | | | | | : | | : | | | | : | : | | : | | | | : | : | 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 



Qy 



181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 
I I I I : I I I I I I I I I | | MIMIII: :IM:|::| lllllhll l|::||||||:| 



Db 



185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 



QY 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSI YTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I I I = I I I : I I I I I I II | Ml I I I : I I I I I I I M I I I : I I M : I I I : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 304 

QY 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I I M I I 
Db 305 MLMNQLRHNFKSLTSF 320 



KW 
KW 
KW 



RESULT 7 
ABR41222 

ID ABR41222 standard; protein; 371 AA. 
XX 

AC ABR41222; 
XX 

DT 02-JUN-2003 (first entry) 
XX 

DE Human DITHP receptor. 
XX 

KW Human; dithp; diagnostic and therapeutic polynucleotide; diagnosis; 

KW cancer; cell proliferative disorder; autoimmune disorder; 

KW inflammatory disorder; infection; hormonal disorder; metabolic disorders- 
neurological disorder; gastrointestinal disorder; transport disorder; 
connective tissue disorder; drug screening; proteome analysis; 
gene therapy; antisense therapy; genotyping; transgenic animal; knock in; 

KW disease model; toxicological testing; transcript imaging; receptor. 
XX 

OS Homo sapiens. 
XX 

PN WO200297031-A2. 
XX 

PD 05-DEC-2002. 
XX 

PF 27-MAR-2002; 2002WO-US010056 . 
XX 

PR 28-MAR-2001; 2 001US-027 9619P . 

PR 29-MAR-2001; 2 001US- 02 8 0067P . 

PR 29-MAR-2001; 2 001US-02 8 00 68P . 

PR 16-MAY-2001; 2 001US- 02 912 8 OP . 

PR 17-MAY-2001; 2001US-0291829P . 

PR 17-MAY-2001; 2001US-0291849P . 

PR 19-JUN-2001; 2001US-0299428P . 

PR 20-JUN-2001; 2 001US-02 9977 6P . 

PR 20-JUN-2001; 2 001US- 0300001P . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Daffo A, Jones AL, Tran AB, Dahl CR, Gietzen D, Chinn J; 

PI Dufour GE, Hillman JL, Yu JY, Tuason O, Yap PE, Amshey SR; 

PI Daughtery SC, Dam TC, Liu TF, Nguyen DA, Kleefeld Y, Gerstin EH; 

PI Peralta CH, David MH, Lewis SA, Chen A J, Panzer SR, Harris B; 

PI Flores V, Marwaha R, Lo A, Lan RY, Urashka ME; 

XX 

DR WPI; 2003-129518/12. 

DR N-PSDB; ACC46165. 



XX 

PT Novel human diagnostic and therapeutic polypeptide useful for identifying 

PT test compound which specifically binds to a polypeptide encoded by human 

PT diagnostic and therapeutic polynucleotide, and to induce antibodies. 
XX 

PS Claim 27; SEQ ID NO 757; 591pp; English. 
XX 

CC The invention relates to novel human diagnostic and therapeutic 

CC polynucleotides designated dithp (ACC46080-ACC46749) and to their encoded 

CC proteins (DITHP; ABR4 1136-ABR4 18 12 ) . The invention also relates to 

CC polynucleotide sequences at least 90% identical to the dithp cDNA 

CC sequences of the invention; recombinant vectors, host cells and 

CC transgenic organisms comprising a dithp nucleic acid sequence; the 

CC recombinant production of DITHP proteins; antibodies specific for DITHP 

CC proteins; microarrays comprising dithp nucleic acid sequences; methods of 

CC detecting dithp nucleotide and protein sequences; methods of screening 

CC for compounds which specifically bind a DITHP protein; and methods of 

CC assessing the toxicity of test compounds using a dithp hybridisation 

CC probe. Dithp nucleic acid sequences and DITHP proteins may be used in the 

CC diagnosis of a wide variety of conditions including cancer and other cell 

CC proliferative disorders; autoimmune or inflammatory disorders; bacterial, 

CC viral, fungal or parasitic infections; hormonal disorders; metabolic 

CC disorders; neurological disorders; gastrointestinal disorders; transport 

CC disorders; and connective tissue disorders. They may also be used to 

CC screen for modulators of protein activity or gene expression. DITHP 

CC proteins can additionally be used in analysis of the proteome of a tissue 

CC or cell type and to induce antibodies. The dithp nucleic acids are 

CC additionally useful in somatic or germline gene therapy of the disorders 

CC mentioned above, as a source of antisense sequences, as a source of 

CC probes and primers, in genotyping and identification of individuals, in 

CC the generation of transgenic animal models of human disease or knock in 

CC humanised animals, in toxicological testing, and in transcript imaging. 

CC The present sequence represents a DITHP protein which has receptor 

CC activity. Note: The sequence data for this patent did not form part of 

CC the printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences 

XX 

SQ Sequence 371 AA; 

Query Match 75.1%; Score 1238.5; DB 6; Length 371; 

Best Local Similarity 72.2%; Pred. No. 3.9e-117; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I : I : I I I I II I I I I I I || |||: |:||| I I : I I : I : I || | II I : I I II 
Db 42 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIVVYGYIFSLKNWNSSNIYLFN 101 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

ll:M I I I I I I I : I I : II II I I I I I II I I I I II I I I I I I I I I I I I || : I | II : : 

Db 102 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 161 

QY 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I I I I I I I I I I M I I I I II I : I I I I I I : I I : I III : I : I I : I I I I : I : I 
Db 162 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 221 



181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 
I I M : I I I I I I I I I I I MINIM: : t I I : I : : I , I I I : ||::||IMIM 



Db 



222 LIYSMCLTLLGFLIPLFVMCFFYYKI7VLFLKQRNRQVAT71LPLEKPLNLVIMAWIFSVL 2 81 



Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I I I = I t I : I I I I I I I I I III I I I : I I I I I I I I I | | | : I M I : I I I : I : 
Db 2 82 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 341 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I I I I I I 
Db 342 MLMNQLRHNFKSLTS F 357 



AAE15633; 

12-MAR-2002 (first entry) 

Human G-protein coupled receptor-3 (GCREC-3) protein. 

Human; G-protein coupled receptor-3; GCREC-3; therapy; cancer; stroke; 
cell proliferative disorder; neurological; epilepsy; Parkinson's disease; 
Alzheimer's disease; inflammation; thyroiditis; haemolytic anaemia; AIDS; 
Acquired Immune Deficiency Syndrome; dementia; nootropic; cholelithiasis; 
multiple sclerosis; atherosclerosis; angina pectoris; gastroenteritis; 
diabetes; ulcer; viral infection; immunosuppressive. 



RESULT 8 
AAE15633 

ID AAE15633 standard; protein; 379 AA. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 



Homo sapiens, 
Key 

Domain 
Domain 
Domain 
Domain 



WO200198351-A2. 



27-DEC-2001. 



Location/ Qualifiers 
187. .206 

/label= Transmembrane_domain 
234. .253 

/label= Transmembrane_domain 
276. .296 

/label= Transmembrane_domain 
319. .342 

/label= Transmembrane domain 



15-JUN-2001; 2001WO-US019275 . 



16-JUN-2000; 
22-JUN-2000; 
29-JUN-2000; 
07-JUL-2000; 
14-JUL-2000; 
19-JUL-2000; 
21-JUL-2000; 



2000US-0212483P. 
2000US-0213954P. 
2000US-0215209P. 
2000US-0216595P. 
2000US-0218936P. 
2000US-0219154P. 
2000US-0220141P. 



(INCY-) INCYTE GENOMICS INC. 
Lai P, Baughn MR, Hafalia AJA, 



Nguyen DB, Gandhi AR, Kallick DA; 



PI Griffin JA, Yue H, Khan FA, Patterson C, Lu DAM, Tribouley CM; 

PI Lu Y, Walia NK, Graul R, Yao MG, Yang J, Ramkumar J, Au-Young J; 

PI Elliott VS, Hernandez R, Walsh RT, Borowsky ML, Thornton M, He A; 
XX 

DR WPI; 2002-075627/10. 

DR N-PSDB; AAD24958. 
XX 

PT Isolated human G-protein coupled receptor polypeptides and the use of 

PT these sequences in the diagnosis, treatment and prevention of diseases 

PT and in the assessment of exogenous compounds on the expression of the 

PT receptors . 
XX 

PS Claim 1; Page 115-116; 143pp; English. 
XX 

CC The invention relates to isolated human G-protein coupled receptor 

CC (GCREC) polypeptides and their biologically active fragments. GCREC and 

CC protein is useful in treating a disease or condition associated with an 

CC increase or decrease in expression of functional GCREC. The GCREC 1 s are 

CC useful in the diagnosis, treatment and prevention of cell proliferative 

CC disorders (cancer, leukaemia, melanoma); neurological disorders (stroke, 

CC epilepsy, Parkinson's disease, dementia, Alzheimer's disease); autoimmune 

CC inflammatory disorder (thyroiditis, haemolytic anaemia, AIDS, multiple 

CC sclerosis); cardiovascular disorder (atherosclerosis, angina pectoris), 

CC gastrointestinal disorder (ulcer, cholelithiasis, gastroenteritis), 

CC metabolic disorders (diabetes); viral infections (herpes virus) and in 

CC the assessment of the effects of exogenous compounds on the expression of 

CC the nucleic acid and amino acid sequences. The present sequence is human 

CC GCREC-3 protein 
XX 

SQ Sequence 379 AA; 

Query Match 75.1%; Score 1238.5; DB 5; Length 379; 

Best Local Similarity 72.2%; Pred. No. 4e-117; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAI LNKYYLSAFYAI EFI FGLLGNVTVVFGYLFCMKNWNS SNVYLFN 60 

II I : I : I I I I II I I I I I I II III: I : I I I I I : I I : I : M I I I I I : I I I I 
Db 50 MAWNATCKNWLAAEAALEKYYLS I FYGI EFWGVLGNTI WYGYI FSLKNWNS SNI YLFN 109 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

M : I I I I I I I II : I I : I I I I I I I I I II I M I I I I I II I I I I I I I I I I : I I I I : : 

Db 110 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 169 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

M I I I I I I I I I I I I I I I I I I : I I II I I : I I : I II I : I : I I : I I I ! : I : I 
Db 170 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 229 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

I I I I : I I I M I I I I I I I I I I I I I I : : I I I : I : : I I II II I : I I I I : : I II I I I : I 
Db 230 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 289 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I I I : I I I : I I I I I I II I III I I I : I I I I I I I I | I I I : M | I : I M : I : 
Db 290 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 349 



Qy 



300 ML I S KFRQ YFKS LT S F 315 
II::: I II II I II 



Db 350 MLMNQLRHNFKSLTSF 365 



RESULT 9 
ABG72131 

ID ABG72131 standard; protein; 334 AA. 
XX 

AC ABG72131; 
XX 

DT 30-JAN-2003 (first entry) 
XX 

DE Human adenosine receptor. 
XX 

KW Human; mammalian; adenosine receptor; G-protein coupled receptor; GPCR; 

KW adenosine-mediated medical condition; vasodilation; hypotension; 

KW reversal of tachycardia; chronic renal disease; thyroid disorder; 

KW inflammation; asthma; hypertensive; antiarrhythmic; antiinflammatory; 

KW receptor . 

XX 

OS Homo sapiens. 
XX 

PN US2002137887-A1. 
XX 

PD 26-SEP-2002. 
XX 

PF 17-JAN-2001; 2001US-00765034 . 
XX 

PR 17-JAN-2001; 2001US-00765034 . 
XX 

PA (HEDR/) HEDRICK J A. 

PA (LACH/) LACHOWICZ J E. 

PA (WANG/) WANG W. 

PA (GUST/) GUSTAFSON E L. 

XX 

PI Hedrick JA, Lachowicz JE, Wang W, Gustafson EL; 
XX 

DR WPI; 2003-074992/07. 

DR N-PSDB; ABS57291. 
XX 

PT Novel isolated mammalian adenosine receptor polypeptide useful for 

PT identifying an agonist or antagonist of the receptor for treating 

PT vasodilation, hypotension, chronic renal diseases, thyroid disorders and 

PT inflammation. 

XX 

PS Claim 2; Page 16-17; 19pp; English. 
XX 

CC The present invention relates to the isolation of a mammalian (human) 

CC adenosine receptor, and the polynucleotide sequence encoding it. The 

CC cloned receptor resembles a member of the G-protein coupled receptor 

CC (GPCR) super family that contains 7-transmembrane domains. The adenosine 

CC receptor is useful for identifying agonists and antagonists of the 

CC receptor, which may be useful for treating an adenosine-mediated medical 

CC condition. The adenosine receptor polypeptide sequence is also useful as 

CC an antigen to elicit antibody production in an immunologically competent 

CC host. An antibody which binds specifically to the adenosine receptor is 

CC useful for treating medical conditions caused or mediated by adenosine 

CC such as vasodilation, hypotension, reversal of tachycardia, chronic renal 



diseases, thyroid disorders and inflammation (e.g. asthma) . The antibody 
can also be used to purify the adenosine receptor, or as a basis for 
immunoassays of the receptor. The polynucleotide sequence encoding the 
adenosine receptor is useful for producing vectors and host cells 
containing the vectors. It is also useful for measuring expression of a 
mammalian adenosine receptor gene in a biological sample. The present 
sequence represents human adenosine receptor 

Sequence 334 AA; 

Query Match 74.6%; Score 1231.5; DB 6; Length 334; 

Best Local Similarity 71.8%; Pred. No. 1.8e-116; 

Matches 227; Conservative 42; Mismatches 46; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNWVVFGYLFCMKNWNSSNVYLFN 60 

I I I = I : I I I I II I II I I I II III: I : I I I | I : | | : | : | || | I I I : M I I 
Db 5 MAWNATCKNWKAAEAALEKYYLSIFYGIEFVVGv^ 64 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

IIMl I I II I I I : I I : I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I : I I I I : : 

Db 65 LSVSDLAFLCTLPMLIRSYT^GNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I M I I I I I I I II I I I I I II : I 11111:11:1 I I I : | : I I : I I I I : | : | 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAWIFSIL 240 

I I I I : I M M II I I I I MINIM: : I II : I : : I M I II I : II I I : : M I II I : 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVP 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

M M I M II : II I II I II I I I I I | | : | | | M II I I I II : II II : I II : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 304 

Qy 300 ML I S KFRQ YFKS LT S F 315 

I I ::: I II M I II 
Db 305 MLMNQLRHN FKS LT S F 320 



RESULT 10 
AAW19854 

ID AAW19854 standard; protein; 334 AA. 
XX 



AC AAW19854; 
XX 

DT ll-SEP-1997 (first entry) 
XX 

DE Human purinergic receptor P2U2 . 
XX 

KW P2U2 receptor; purinergic receptor; diagnosis; therapy. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT Domain 1. .23 

FT /label= N-terminal_domain 

FT Domain 24. .49 



CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



FT /label= TMI 

FT /note= "transmembrane domain I" 

FT Domain 50. .60 

FT /label- ICDI 

FT /note= "intracellular domain I" 

FT Domain 61. .82 

FT /label= TMII 

FT /note= "transmembrane domain II" 

FT Domain 83. .99 

FT /label- ECDI 

FT /note= "extracellular domain I" 

FT Domain 100. .119 

FT /label- TMI I I 

FT /note= "transmembrane domain III" 

FT Domain 12 0. .141 

FT /label= ICDII 

FT /note= "intracellular domain II" 

FT Domain 142. .161 

FT /label- TMDIV 

FT /note- "transmembrane domain IV" 

FT Domain 162. .183 

FT /label- ECDII 

FT /note= "extracellular domain II" 

FT Domain 184. .207 

FT /label- TMDV 

FT /note= "transmembrane domain V" 

FT Domain 208. .233 

FT /label= ICDIII 

FT /note= "intracellular domain III" 

FT Domain 234. .256 

FT /label= TMDVI 

FT /note- "transmembrane domain VI" 

FT Domain 257. .276 

FT /label- ECDIII 

FT /note- "extracellular domain III" 

FT Domain 277. .300 

FT /label= TMDVII 

FT /note- "transmembrane domain VII" 

FT Domain 301. .334 

FT /label- C-terminal_domain 
XX 



PN WO9720045-A2. 
XX 

PD 05-JUN-1997. 
XX 

PF 08-NOV-1996; 96WO-US018 17 5 . 
XX 

PR 15-NOV-1995; 95US-0006782P . 

PR 15-NOV-1995; 95US-00559524 . 
XX 

PA (CORT-) COR THERAPEUTICS INC. 
XX 

PI Conley PB, Jantzen H; 
XX 

DR WPI; 1997-310601/28. 

DR N-PSDB; AAT71900. 
XX 



PT New isolated purinergic receptor sub-type - used to develop products for 

PT diagnosis and therapy, e.g. for screening for agonists and antagonists 

PT which can modulate activation. 
XX 

PS Claim 1; Fig 1A-B; 36pp; English. 
XX 

CC P2U2 receptor (AAW19854) is a novel human purinergic receptor subtype 

CC that is abundantly expressed in kidney and in many cell lines of 

CC megakaryocytic or erythroleukaemic origin and which is activated by ATP, 

CC UDP, UTP and UDP . Its amino acid sequence was deduced from a cDNA clone 

CC derived from DAMI (ATCC CRL 9792) cells. P2U2 and its polypeptides can be 

CC expressed in host cells and used to develop diagnostic and therapeutic 

CC agents. Antagonists and agonists based on the extracellular domains of 

CC P2U2 receptor, or which affect receptor function by binding to one of the 

CC intracellular domains, can be used to treat diseases caused by aberrant 

CC activation of this receptor or to treat diseases whose symptoms can be 

CC ameliorated by stimulating or inhibiting the activity of the receptor 
XX 

SQ Sequence 334 AA; 



Query Match 74.3%; Score 1226.5; DB 2; Length 334; 

Best Local Similarity 71.5%; Pred. No. 5.7e-116; 

Matches 226; Conservative 42; Mismatches 47; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNVTVVFGYLFCMKNWNSSNVYLFN 60 

I ! I : I : I I I I II I I I I I I II III: I : I I I I I : II : I : I I M I I I : I I I I 
Db 5 MAWNATCKNWLAAEAALEKYYLS I FYGI EFVVGVLGNTI WYGYI FSLKNWNS SNI YLFN 64 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

M : II I I I I I I I : I I : I I I I I I I I II II I I I I I I I I I I I II I II I I I : I I I I : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I M I I I I I I I I I I I I I I I I I : I I II I I : I I : I , I I : I : | i : I I I I : I : | 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

I I I I :l I I I II I I I I I llllllll: :M|:|::| IIIM|:M IhMllllhl 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I M I : I I I : I I I I I I M I III I I I : I I I I I I I I I I : I I I I : II I : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRALGFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I I I I I I 
Db 305 MLMNQLRHNFKSLTSF 320 



RESULT 11 
ABU63309 

ID ABU63309 standard; protein; 334 AA. 
XX 

AC ABU63309; 
XX 

DT 18-SEP-2003 (first entry) 
XX 



DE Human ATP receptor. 
XX 

KW Human; receptor; ATP receptor; G-protein coupled receptor; gene therapy; 

KW 7-transmembrane receptor; asthma; allergic rhinitis; hypertension; ulcer; 

KW angina pectoris; allergy; psychosis; depression; migraine; vomiting; 

KW benign prostatic hypertrophy; arterial thrombosis; myocardial infarction; 

KW urinary retention; angioplasty; cystic fibrosis; Parkinson 1 s disease; 

KW acute heart failure; hypotension; thrombolysis; osteoporosis. 

XX 

OS Homo sapiens. 
XX 

PN US2003054487-A1. 
XX 

PD 20-MAR-2003. 
XX 

PF 16-OCT-2002; 2002US-00270587 . 
XX 

PR ll-JAN-1996; 96US-0009902P . 

PR 10-JAN-1997; 97US-00781456. 

PR 20-JUL-2001; 2001US-00908593 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Li Y; 
XX 

DR WPI; 2003-540615/51. 

DR N-PSDB; ACD27619. 
XX 

PT New polynucleotide, useful for producing a medicament for treating 

PT asthma, allergic rhinitis or hypertension. 

XX 

PS Claim 1; Fig 1; 24pp; English. 
XX 

CC The invention relates to an isolated polynucleotide encoding a G-protein 

CC coupled, 7-transmembrane ATP receptor. The polynucleotide is useful for 

CC producing a medicament for treating asthma, allergic rhinitis or 

CC hypertension. Antagonists for the the ATP receptor can be used to treat 

CC angina pectoris, ulcers, allergies, psychoses, depression, migraine, 

CC vomiting, benign prostatic hypertrophy, arterial thrombosis, myocardial 

CC infarction, thrombolysis, angioplasty, cystic fibrosis. Agonists of the 

CC ATP receptor can be used to treat Parkinson ! s disease, acute heart 

CC failure, hypotension, urinary retention and osteoporosis. The present 

CC sequence represents the amino acid sequence of the human ATP receptor 
XX 

SQ Sequence 334 AA; 



Query Match 74.2%; Score 1224.5; DB 6; Length 334; 

Best Local Similarity 71.5%; Pred. No. 9e-116; 

Matches 226; Conservative 43; Mismatches 46; Indels 1; Gaps 1; 

QY 1 MAQNL S CENWLAT EAI LN KY YL S AF YAI E F I FGLLGNVT WFG YL FCMKNWN S SNVYL FN 60 

M I : I : I I I I M I I I I I I II Mh hill I I : I I : I : I I M I I I : I I I I 
Db 5 MAWNATCKNWL/VAEAALEKYYLSIFYGIEFWGVLGNTIVA^GYIFSLKNWNSSNIYLra 64 

QY 61 LS I SDFAFLCTLPI LIKS YANDKGTYGDVLCI SNRYVLHTNLYTSILFLTFI SMDRYLLM 120 

Ihll I I I I I I I : I I : I I II I I I II I I I I I I I I I I I I I I I I I I I I I I : I I I I : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 



Qy 


121 


KYPFREHFLQKKEFAILISIAWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 : 1 1 : 1 II 1 : 1 : 1 1 : 1 1 1 1 : | : | 
KYPFREHLLQKKECAILISLAMWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 


180 


Db 


125 


184 


Qy 


181 


LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 
M 1 1 : 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I : : | M : | : : | 1 1 I I I | : | | | | : : | | | | | | : | 
LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 


240 


Db 


185 


244 


Db 


Z 4 1 

245 


£ I PYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 
1 M 1 1 : 1 1 1 : 1 1 1 1 1 1 II 1 III 1 1 1 : 1 1 1 : 1 1 I I I 1 1 1 : 1 1 1 1 : 1 1 1 : 1 : 
FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPVAFLNSVINPVFYFLVGDHFRD 


299 
304 


Qy 


300 


MLISKFRQYFKSLTSF 315 
II::: 1 1 1 1 1 II 1 
MLMNQLRHNFKSLTSF 320 




Db 


305 





RESULT 12 
AAW22732 



ID AAW22732 standard; protein; 334 AA. 
XX 

AC AAW22732; 
XX 

DT 07-OCT-1997 (first entry) 
XX 

DE Human ATP receptor. 
XX 

KW ATP receptor; G-protein coupled receptor; agonist; antagonist. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualif iers 

FT Misc-difference 212 

FT /note- "encoded by TCC" 

FT Misc-difference 235 

FT /note= "encoded by TCG" 

FT Misc-difference 244 

FT /label= Unknown 

FT /note= "encoded by CYT" 

XX 

PN W09724929-A1. 
XX 

PD 17-JUL-1997. 
XX 

PF ll-JAN-1996; 9 6WO-US0003 92 . 
XX 

PR ll-JAN-1996; 96WO-US000392 . 



XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Li Y; 
XX 

DR WPI; 1997-372505/34. 

DR N-PSDB; AAT75146. 
XX 

PT Isolated human ATP receptor - agonists and antagonists of which are 



PT useful in treatment of, e.g. asthma, hypertension, arterial thrombosis 

PT and psychotic and neurological disorders. 

XX 

PS Claim 15; Fig 1A-C; 53pp; English. 
XX 

CC Human ATP receptor (AAW22732) is structurally related to the G protein- 

CC coupled receptor family. It shows 29.8% identity to a murine P2u 

CC receptor. Its amino acid sequence was deduced from a human placental cDNA 

CC clone (AAT75146) . Recombinant ATP receptor can be expressed in bacterial 

CC (e.g. E. coli) , mammalian (e.g. COS) or insect (e.g. Sf9) host cells and 

CC used to screen for agonists and antagonists useful in the treatment of 

CC conditions related to underexpression of the receptor (e.g. asthma, 

CC Parkinson 1 s disease, acute heart failure, hypotension, urinary retention 

CC and osteoporosis) or underexpression of the receptor (e.g. arterial 

CC thrombosis, hypertension, thrombolysis, angioplasty, cystic fibrosis, 

CC ulcers, asthma, allergy, benign prostatic hypertrophy, psychotic and 

CC neurological disorders, dyskinesias, endogenous anorexia and bulimia) 

XX 

SQ Sequence 334 AA; 

Query Match 73.9%; Score 1219.5; DB 2; Length 334; 

Best Local Similarity 71.2%; Pred. No. 2.9e-115; 

Matches 225; Conservative 43; Mismatches 47; Indels 1; Gaps 1; 

Qy 1 MAQNLS CENWLATEAI LNKYYLSAFYAI EFI FGLLGNWVVFGYLFCMKNWNS SNVYLFN 60 

II I : I : I I I I II I I I I I I II III: hill I I : I I : I : I I I I I I I : I I I I 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFVVGVLGNTIVVYGYIFSLKNWNSSNIYLFN 64 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

I I : I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I : | | | | : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I I I I II I I I I I I I II I I I : I I I I I I : I I : I III : | : | | : | | | | : | : | 
Db 125 KYPFREHLLQKKECAILISLAMWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

M I I : I I I I I I I I I I I I I I I I I I I : : I I I : I : : I I I I I I I : II I I : : I I I I I I : 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVX 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I I I : I I I : I I I I I I M I III I I I : I I I : I I I I I I I I : I I I I : I II : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPVAFLNSVINPVFYFLVGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I I I I I I 
Db 305 MLMNQLRHNFKSLTSF 320 



RESULT 13 
AAU31029 

ID AAU31029 standard; protein; 387 AA. 
XX 

AC AAU31029; 
XX 

DT 18-DEC-2001 (first entry) 
XX 



DE Novel human secreted protein #1520. 
XX 

KW Human; vaccination; gene therapy; nutritional supplement; 

KW stem cell proliferation; haematopoiesis ; nerve tissue regeneration; 

KW immune suppression; immune stimulation; anti-inflammatory; leukaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO200179449-A2. 
XX 

PD 25-OCT-2001. 
XX 

PF 16-APR-2001; 2001WO-US008 656 . 
XX 

PR 18-APR-2000; 2000US-0055292 9 . 

PR 26-JAN-2001; 2001US-00770160 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Drmanac RT; 
XX 

DR WPI; 2001-611725/70. 
XX 

PT Nucleic acids encoding a range of human polypeptides, useful in genetic 

PT vaccination, testing and therapy. 

XX 

PS Claim 20; Page 392; 765pp; English. 
XX 

CC The invention relates to novel human secreted polypeptides. The 

CC polypeptides and antibodies to the polypeptides are useful for 

CC determining the presence of or predisposition to a disease associated 

CC with altered levels of polypeptide. The polypeptides are also useful for 

CC identifying agents (agonists and antagonists) that bind to them. Cells 

CC expressing the proteins are useful for identifying a therapeutic agent 

CC for use in treatment of a pathology related to aberrant expression or 

CC physiological interactions of the polypeptide. Vectors comprising the 

CC nucleic acids encoding the polypeptides and cells genetically engineered 

CC to express them are also useful for producing the proteins. The proteins 

CC are useful in genetic vaccination, testing and therapy, and can be used 

CC as nutritional supplements. They may be used to increase stem cell 

CC proliferation; to regulate haematopoiesis; and in bone, cartilage, tendon 

CC and/or nerve tissue growth or regeneration; immune suppression and/or 

CC stimulation; as anti-inflammatory agents; and in treatment of leukaemias. 

CC AAU2 9510-AAU33304 represent the amino acid sequences of novel human 

CC secreted proteins of the invention 

XX 

SQ Sequence 387 AA; 

Query Match 73.7%; Score 1216.5; DB 4; Length 387; 

Best Local Similarity 71.1%; Pred. No. 7.2e-115; 

Matches 224; Conservative 43; Mismatches 47; Indels 1; Gaps 1; 

Qy 2 AQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNVTVVFGYLFCMKNWNSSNVYLFNL 61 

I I : I : : I I I II I I I I I I II III: hill I I : II : I : I I I I I I I : I I I I I 
Db 59 AWNATCKHWLAAEAALEKYYLSIFYGIEFWGVLGNTIWTGYIFSLKNWNSSNIYLFNL 118 



Qy 



62 SISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMK 121 



I : II II I I I I I : I I : I M I I I I I I I I II I I I I I I I I II I I I I I II I : | | | | : : | 

Db 119 SVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLIIK 178 

Qy 122 YPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNL 181 

I I I I I I I I I I I I I I I I I I I : I I I I I I : I I : I II I : 1 : I I : I I I I : I : I I 
Db 179 YPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYNL 238 

Qy 182 IYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQAT7VLPLDKPQRLWLAWIFSILF 241 

111:1111111 III III lllh : I I I : I : : I I II I I I : I I I I : : I I I I I | : | | 
Db 239 IYSMCLTLLGFSIPLFVMCLFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVLF 2 98 

Qy 242 TPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREM 300 

I I I I : I I I : I I I I I I II I III I I I : I I I I I I I I I I I I : I I I I : I II : I : I 
Db 299 TPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRDM 358 

Qy 301 LISKFRQYFKSLTSF 315 

I : : : I I I I I I I I 
Db 359 LMNQLRHNFKSLTSF 373 



RESULT 14 
ADC12680 

ID ADC12680 standard; protein; 335 AA. 
XX 

AC ADC12 680; 
XX 

DT 18-DEC-2003 (first entry) 
XX 

DE Human GPCR protein, SEQ ID No 12. 
XX 

KW G protein-coupled receptor; GPCR; antibacterial; fungicide; protozoacide; 

KW virucide; antirheumatic; antiarthritic; tranquiliser ; antidiabetic; 

KW osteopathic; nootropic; neuroprotective; anorectic; cardiant; 

KW neuroleptic; cytostatic; antiparkinsonian; hypotensive; hypertensive; 

KW antiulcer; antiallergic; anticonvulsant; analgesic; infection; 

KW rheumatoid arthritis; chronic obstructive pulmonary diseases; COPD; 

KW asthma; non-insulin dependent diabetes; obesity; osteoporosis; 

KW Alzheimer ! s disease; age-related macular degeneration; 

KW myocardial infarction; schizophrenia; osteoarthritis; cancer; 

KW Parkinson's disease; congestive heart failure; hypotension; hypertension; 

KW ulcer; allergy; benign prostatic hyperplasia; seizure disorder; anxiety; 

KW obsessive compulsive disorder; Cushing's syndrome; hypopituitarism; pain; 

KW human . 

XX 

OS Homo sapiens . 
XX 

PN WO2003000893-A2 . 
XX 

PD 03-JAN-2003. 
XX 

PF 24-JUN-2002; 2002WO-IB002357 . 
XX 

PR 26-JUN-2001; 2 001US-0301095P . 

PR 06-NOV-2 001; 2001US-0333185P . 
XX 

PA (DECO-) DECODE GENETICS EHF. 
XX 



PI Martinez RMA, Sigurdsson GT; 
XX 

DR WPI; 2003-210155/20. 

DR N-PSDB; ADC12679. 
XX 

PT New G protein-coupled receptor (GPCR) genes and polypeptides, useful for 

PT diagnosing diseases associated with a GPCR, or in gene therapy for 

PT treating e.g. obesity, osteoporosis, Alzheimer's, cancers or congestive 

PT heart failure. 

XX 

PS Claim 10; SEQ ID NO 12; 253pp; English. 
XX 

CC The invention relates to a novel isolated nucleic acid of a G protein- 

CC coupled receptor (GPCR) gene comprising any of 62 sequences of 912-2454 

CC bp, or its complements; a GPCR polypeptide comprising any of 62 sequences 

CC of 291-818 amino acids; or a nucleic acid that hybridises, under high 

CC stringency conditions, with any of the 62 GPCR sequences or any of their 

CC complements. The GPCR agents of the invention have the following 

CC activities: antibacterial, fungicide, protozoacide, virucide, 

CC antirheumatic, tranquiliser , antiarthritic, antidiabetic, osteopathic, 

CC nootropic, neuroprotective, anorectic, cardiant, neuroleptic, cytostatic, 

CC antiparkinsonian, hypotensive, hypertensive, antiulcer, antiallergic, 

CC anticonvulsant, and analgesic. The GPCR therapeutic agent, particularly a 

CC GPCR gene agonist or antagonist, is useful for treating a disease or 

CC condition associated with a GPCR in an individual. The nucleic acid cited 

CC above, which is 100 or fewer nucleotides in length, is useful for 

CC assaying a sample for the presence of the GPCR gene nucleic acid or a 

CC GPCR gene nucleic acid with at least one nucleotide difference from a 

CC first nucleic acid, or for diagnosing a susceptibility to a disease or 

CC conditions associated with a GPCR. These diseases include infections 

CC (e.g. bacterial, fungal, protozoan or viral), rheumatoid arthritis, 

CC chronic obstructive pulmonary diseases (COPD) , asthma, non-insulin 

CC dependent diabetes, obesity, osteoporosis, Alzheimer's disease, age- 

CC related macular degeneration, myocardial infarction, schizophrenia, 

CC osteoarthritis, cancers, Parkinson's diseases, congestive heart failure, 

CC hypotension, hypertension, ulcers, allergies, benign prostatic 

CC hyperplasia, seizure disorder, anxiety, obsessive compulsive disorder, 

CC Cushing's syndrome, hypopituitarism, or pain. This sequence represents 

CC one of the 62 GPCR proteins of the invention. 

XX 

SQ Sequence 335 AA; 

Query Match 71.8%; Score 1185.5; DB 7; Length 335; 

Best Local Similarity 72.5%; Pred. No. 8.4e-112; 

Matches 219; Conservative 40; Mismatches 42; Indels 1; Gaps 1; 

Qy 15 AILNKYYLSAFYAIEFIFGLLGNVTWEGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPI 74 

I I Mill II III: hill I I : I I : I : I I I 1 I I I : 1 I i I I I : II I I I I I I I : 
Db 1 AALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFNLSVSDLAFLCTLPM 60 

Qy 75 LIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEF 134 

I I : I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I : I I I I :: I I I I II I I I I I I I 

Db 61 LIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLIIKYPFREHLLQKKEF 120 

Qy 135 AILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLI 194 

I I I I I I I : I I I I I I : I I : I II I : I : I I : I I M : | : I I I I I : II I I I I I I I 
Db 121 AILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYNLIYSMCLTLLGFLI 180 



Qy 195 PLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIAS 254 

II IIIIIMI: :IM:|::| lllllhll I I :: I I I I I I : I | | | | I : I II : I I I I 
Db 181 PLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVLFTPYHVMRNVRIAS 240 

Qy 255 RLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSLT 313 

M II I III 111:111111111 ll|:||||:|ll:|:||::: I Mill 
Db 241 RLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRDMLMNQLRHNFKSLT 300 

Qy 314 SF 315 

I I 

Db 301 SF 302 



RESULT 15 
AAB45376 

ID AAB45376 standard; protein; 258 AA. 
XX 

AC AAB45376; 
XX 

DT 14-FEB-2001 (first entry) 
XX 

DE Human secreted protein sequence encoded by gene 37 SEQ ID NO: 12 8. 
XX 

KW Human; secreted protein; diagnosis; immunosuppressive; antiarthritic; 

KW antirheumatic; antiproliferative; cytostatic; cardiant; vasotropic;, 

KW cerebroprotective; nootropic; neuroprotective; antibacterial; virucide; 

KW fungicide; ophthalmological ; vulnerary; gene therapy; autoimmune disease; 

KW hyperprolif erative disorder; cardiovascular disorder; angiogenesis ; 

KW cerebrovascular disorder; nervous system disorder; infection; skin aging; 

KW ocular disorder; wound healing; food additive; preservative. 

XX 

OS Homo sapiens. 
XX 

PN WO200061628-A1. 
XX 

PD 19-OCT-2000. 
XX 

PF 06-APR-2000; 2000WO-US009070 . 
XX 

PR 09-APR-1999; 99US-012 8 695P . 

PR 14-JAN-2000; 2000US-0176052P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Ruben SM, Komatsoulis G; 
XX 

DR WPI; 2000-619228/59. 
XX 

PT New nucleic acid molecules encoding 4 9 human secreted proteins for 

PT diagnosing, preventing, treating or ameliorating medical conditions and 

PT used as food additives or preservatives . 

XX 

PS Disclosure; Page 447-448; 454pp; English. 
XX 

CC The polynucleotide sequences given in AAC81086 to AAC81134 encode the 

CC human secreted proteins given in AAB45308 to AAB45356. AAB45357 to 



CC AAB45384 represent human secreted polypeptide sequences and proteins 

CC homologous to them, which are given in the exemplification of the present 

CC invention. Human secreted proteins have activities based on the tissues 

CC and cells the genes are expressed in. Examples of activities include: 

CC antiarthritic; immunosuppressive; antirheumatic; antiproliferative; 

CC cytostatic; cardiant; vasotropic; cerebroprotective; nootropic; 

CC neuroprotective; antibacterial; virucide; fungicide; ophthalmological ; 

CC and vulnerary. The polynucleotides and polypeptides can be used to 

CC prevent, treat or ameliorate a medical condition in e.g. humans, mice, 

CC rabbits, goats, horses, cats, dogs, chickens or sheep. They are also used 

CC in diagnosing a pathological condition or susceptibility to a 

CC pathological condition. Disorders which are diagnosed or treated include 

CC autoimmune diseases, hyperprolif erative disorders, cardiovascular 

CC disorders, cerebrovascular disorders, angiogenesis , nervous system 

CC disorders, infections caused by bacteria, viruses and fungi and ocular 

CC disorders. The polypeptides can also be used to aid wound healing and 

CC epithelial cell proliferation, to prevent skin aging due to sunburn, to 

CC maintain organs before transplantation, for supporting cell culture of 

CC primary tissues, to regenerate tissues and in chemotaxis . The 

CC polypeptides can also be used as a food additive or preservative to 

CC increase or decrease storage capabilities, fat content, lipid, protein, 

CC carbohydrate, vitamins, minerals, cof actors and other nutritional 

CC components. AAC81077 to AAC81085 and AAB45307 represent sequences used in 

CC the exemplification of the present invention 

XX 

SQ Sequence 258 AA; 

Query Match 61.5%; Score 1014.5; DB 3; Length 258; 

Best Local Similarity 72.9%; Pred. No. 1.5e-94; 

Matches 188; Conservative 33; Mismatches 36; Indels 1; Gaps 1; 

Qy 20 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

I I I I M Ml: hill I I : I I : I : I I II I I I : I I I I I I : I I II I I I II : I I : I I 
Db 1 YYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFNLSVSDLAFLCTLPMLIRSY 60 

Qy 80 ANDKGTYGDVTjCI SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYPFREHFLQKKEFAI LI S 139 

M I I I I I I I I I I I I I! I I I I I I I I I I I I I : I I I! :: I I I I I I | I Ml I I I I I I I 

Db 61 ANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLIIKYPFREHLLQKKEFAILIS 120 

Qy 140 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

I I : I I M I I : I I : I II I : I : I I : I I I I : I : I I I I I : I I I I I I I I I M II 
Db 121 LAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYNLIYSMCLTLLGFLIPLFVM 180 

Qy 200 CFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIASRLDSW 259 

I I M I I : : II I : I : : I I I I II I : I I I I :: I I I I I I : M I I I I : I I I : I I I I I I I I 
Db 181 CFFYYKIALFLKQRNRQVATAiPLEKPLNLVIMAWIFSVLFTPYHVMRNVRIASRLGSW 240 

Qy 260 PQ-GCTQKAIKSIYTLTR 276 

I III I I I : I I 
Db 241 KQYQCTQWINS FYIVTR 258 



Search completed: August 23, 2004, 17:00:22 
Job time : 132 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: August 23, 2004, 16:58:14 ; Search time 31 Seconds 

(without alignments) 
527.917 Million cell updates/sec 

Title: US-0 9-8 91-138A-2 

Perfect score: 1650 

Sequence: 1 MAQNLSCENWLATEAI LNKY REMLISKFRQYFKSLTSFRT 317 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 389414 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/2/iaa/5A__COMB.pep: * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/2/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep: * 

6 : /cgn2_6/ptodata/2/iaa/backfilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB iD Description 



1 


1226. 5 


74 


.3 


334 


2 


US- 


08- 


559- 


524A-2 


Sequence 


2, Appli 


2 


1226.5 


74 


.3 


334 


3 


us- 


08- 


749- 


707-2 


Sequence 


2, Appli 


3 


1226.5 


74 


.3 


334 


4 


us- 


09- 


947- 


922-2 


Sequence 


2, Appli 


4 


474 


28 


7 


373 


2 


us- 


08- 


559- 


524A-4 


Sequence 


4, Appli 


5 


474 


28 


7 


373 


3 


us- 


08- 


749- 


707-4 


Sequence 


4, Appli 


6 


474 


28 


7 


373 


4 


us- 


09- 


947- 


922-4 


Sequence 


4, Appli 


7 


473.5 


28 


7 


362 


3 


us- 


08- 


513- 


974B-374 


Sequence 


374, App 


8 


372 


22 


5 


374 


4 


us- 


09- 


102- 


710B-3 


Sequence 


3, Appli 


9 


370.5 


22 


5 


373 


3 


us- 


08- 


513- 


974B-373 


Sequence 


373, App 


10 


353 


21 


4 


355 


1 


us- 


08- 


153- 


848-28 


Sequence 


28, Appl 


11 


353 


21 


4 


355 


1 


us- 


08- 


153- 


848-32 


Sequence 


32, Appl 



12 


353 


21 


t 4 


j j j 


J 


TTQ — n Q — 9 Q Q — ft 4 ^ Z\ — 9ft 
uo u y j Jn ZO 


Sequence 


28 r Appl 


13 


353 


2 1 


4 


^ s r 


-a 
j 


TTQ— nQ — 9QQ — ft/l^Zi — ^9 


Sequence 


oZ , Appl 


14 


353 


21 


, 4 


355 


4 


Uj \Jj uO O J J 1 D Z.O 


oecjuence 


Z^ 0 , App ± 


15 


353 


9 1 


4 


^ r r 


4 

4i 


UOUyUoOjo / D jZ 


Sequence 


32 , Appl 


16 


353 


21 


4 


? s s 
jjj 


4 


ttq — n Q — 1 1 c\~a q fin_ i ^ n 
uo \jy ± / \J ^you XjU 


Sequence 


liU, App 


17 


353 


91 

£i X 


4 


^ ^ r 

JJJ 


4 


TT^ — fiQ- 1 70 — ,4 Qfin_9 ^9 


Sequence 


z 3 z , App 


18 


353 


21 


4 


JJJ 


J 


PPT-TI^ Q^- 1 1 1 R^-9ft 
X. ^ ± UOjj XXXjj zo 


Sequence 


z o , App 1 


19 


353 


9 1 

X < 


4 


"3 S R 
JJJ 


R 

J 


xr^l Uorj XXXOj oZ 


Sequence 


32, Appl 


20 


338 


20 . 


5 


^9 ft 
J z, o 


Q 
j 


TTq_f)ft— SI "3 — Q 7 4 R— ^ Q 
UO UO jXj y /4JD OS 


Sequence 


Jy, Appl 


2 1 


j j o 


9 Ci 

*L U . 


r 
j 


jZ 0 


0 
J 


Uo Uo OXj~y / 4d~j / 1 


Sequence 


371, App 


22 


j j o 


9 n 

z u . 


r 

j 


000 

jZ 0 




Uo — U j 4ol 4o DD"jy 


Sequence 


39, Appl 


23 


336.5 


20 . 


4 


^ fin 

j D U 


O 
j 


TTQ— HP — P7 ^-^TQ-OH 
UO UO O/jj/jZU 


Sequence 


20, Appl 


24 


336 . 5 


20 . 


4 


^ fin 


j 


TT9 — (1Q- 9^9 — P7ft — 9 
UO Uj ZjZ 0/0 z 


Sequence 


2, Appli 


2 5 


^?fi s 

J J \J . J 


9 n 

Z U • 


A 


^ fin 
j 0 u 


O 
J 


ttq — nq n yi r coo 
UoUy— U4D~Ooo- jj 


Sequence 


55, Appl 


26 


336.5 


9 n 


4 


^ fin 

j 0 u 


4 
41 


ttq— n q— S ^ 4 — 1 RR 
UOUjjj^XOjDj 


Sequence 


55, Appl 


z / 




Z U . 


ft 




4 


Uo-uo-y jy-iu /— o4 


Sequence 


34, Appl 


2 8 


335 . 5 




"5 
o 


^7 R 
j / j 


T 
X 


TTQ— n ft — /I 4 9 — 1 'i/l 7i O 
UO UO 4 1 J1A Z 


Sequence 


2, Appli 


29 


335.5 


9 n 

Z u . 


o 
j 


^ 7 R 
j / j 


X 


TTQ — nft AAA ;oi d O 
Uo — UO — 414 41 — 0 0 1d~ z 


Sequence 


2, Appli 


J U 


j jj • j 


9 0 
Z U . 


o 
j 


J / J 


-1 
X 


Uo — Uo- 44b — Uo oA _ z 


Sequence 


2, Appli 


-J J. 




9 n 

Z U . 


-a 
j 


J / J 


z 


TTQ— n ft — R R q— R O A 7\ "3 
Uo Uo DDy jz 4 A— j 


Sequence 


3, Appli 


^9 

J Z 


^ ^ ^ r 

J J J 1 J 


9 n 
z u * 


o 

j 


one; 
j / j 


Q 


TTQ no Q nnn 0 

Uo-Uo- / 4y — / U / -o 


Sequence 


3, Appli 


33 


JO J i J 


9 0 


-3 
j 


J / J 


4 


TT Q— n Q— q A 7 — Q O O H 

uo uy r4 / y z z j 


Sequence 


3, Appli 


^4 
j m 


J ^ -7 


1 Q 


Q 


OCT 
JJJ 


0 
J 


ttq — nq— n /1 r rqq rq 
uo— uyu4o — do 3— 00 


Sequence 


53, Appl 


J J 


^9Q 


X y . 


q 
y 


^. R R 
JJJ 


4 


TTQ no tQ/l 1QR R'a 

Uo Uy j J4 lb j jj 


Sequence 


53, Appl 


^fi 
j u 




1 Q 

X j * 


q 
y 


^ R R 

jjj 


X 


TTQ — AQ fil O QQDA O 

uo — uo — uiz — yy 0 a— z 


Sequence 


2, Appli 


37 


^9 P. 


1 Q 

X J • 


Q 

y 


^ R ^ 

jjj 


X 


TTQ— nP — /l R n Q Q Q 7\ R 
Uo Uo 40U oy o A— j 


Sequence 


5, Appli 


38 


32 8 


1 Q 

X J • 


q 


? R R 

jjj 


0 
j 


TTQ — OP — A A £^tf^Q R 

uouo44 oooyj 


Sequence 


5, Appli 


j _? 




x y . 


q 


"3 R R 
jjj 




TTQ HQ OOQ ftoo 1 

uo— uy— zoy— yjo- 1 


Sequence 


1, Appli 


40 


328 


19 . 


9 


355 


4 


TjQ-nQ-fiP.fi-^1 QA-14 

Uj OOU JX _7 rt X *i 


Sequence 


14, Appx 


41 


328 


19. 


9 


355 


5 


PCT-US95-00476-5 


Sequence 


5, Appli 


42 


326.5 


19. 


8 


360 


4 


US-08-833-752-10 


Sequence 


10, Appl 


43 


325.5 


19. 


7 


346 


4 


US-09-585-876-2 


Sequence 


2, Appli 


44 


325 


19. 


7 


302 


2 


US-08-467-948A-30 


Sequence 


30, Appl 


45 


325 


19. 


7 


302 


3 


US-08-467-947A-30 


Sequence 


30, Appl 



ALIGNMENTS 



RESULT 1 

US-08-559-524A-2 

; Sequence 2, Application US/08559524A 
; Patent No. 5871963 

GENERAL INFORMATION: 

APPLICANT: Conley, Pamela B. 
; APPLICANT: Jantzen, Hans-Michael 
; TITLE OF INVENTION: NOVEL PURINERGIC RECEPTOR 

; NUMBER OF SEQUENCES : 14 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: MORGAN, LEWIS & BOCKIUS LLP 

; STREET: 1800 M Street, N.W. 

; CITY: Washington 

STATE: D.C. 
; COUNTRY: USA 

ZIP: 20036-5869 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 



; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/559, 524A 

FILING DATE: 15-NOV-1995 

CLASSIFICATION: 435 
; ATTORNEY/AGENT INFORMATION: 

NAME: Adler, Reid G. 

REGISTRATION NUMBER: 30,988 
; REFERENCE/DOCKET NUMBER: 04 44 81-5010- 00-US 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-467-7000 

TELEFAX: 202-467-7176 
INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 334 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-559-524A-2 

Query Match 74.3%; Score 1226.5; DB 2; Length 334; 

Best Local Similarity 71.5%; Pred. No. 4.7e-93; 

Matches 226; Conservative 42; Mismatches 47; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAI LNKYYLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFN 60 

II I = I = I M I II I I I I I I II III: hill I I : I I : I : I II I I M : I I I I 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFVVGVLGNTIVVYGYIFSLKNWNSSNIYLFN 64 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

I I * I I I I I I I I I : II : I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I : I I I I : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

M I M I I I I I I I I I I I I I I I : I I I I I I : I I : I I I | : | : I | : | | | | : | : I 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

I I I I : I M M I I I I I I I I I I I I II : : I I I : I : : I I I I II I : M I I : : I I I I I I : I 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FT P YHIMRNLRIAS RLDSWPQ-GCTQKAI KS I YTLTRPLAFLNSAINPI FYFLMGDHYRE 299 

I I I M : I I I : I I I I M II I Ml | | | : M I , I | , I I I : I I | | : | | | : | : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRALGFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I M I I I I I 
Db 305 MLMNQLRHN FKS LT S F 32 0 



RESULT 2 
US-08-749-707-2 

; Sequence 2, Application US/08749707 

; Patent No. 6063582 

; GENERAL INFORMATION: 

; APPLICANT: Conley, Pamela B. 



I 



APPLICANT: Jantzen, Hans-Michael 

TITLE OF INVENTION: NOVEL PURINERGIC RECEPTOR 

NUMBER OF SEQUENCES: 14 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORGAN , LEWIS & BOCKIUS LLP 
; STREET: 1800 M Street, N.W. 

CITY: Washington 
; STATE : D . C . 

COUNTRY: USA 

ZIP: 20036-5869 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/749,707 
FILING DATE: 15-NOV-1996 
; CLASSIFICATION: 536 

ATTORNEY/AGENT INFORMATION: 
NAME: Adler, Reid G. 
REGISTRATION NUMBER: 30,988 

REFERENCE/ DOCKET NUMBER: 044 4 8 1-5010-01-US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 202-467-7000 
TELEFAX: 202-467-7176 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 334 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-08-749-707-2 



Query Match 74.3%; Score 1226.5; DB 3; Length 334; 

Best Local Similarity 71.5%; Pred. No. 4.7e-93; 

Matches 226; Conservative 42; Mismatches 47; Indels 1; Gaps 1; 

QY 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

M I : I : I I M II I I I I II II III: I : I II II : I I : I : I II I I I I : I I I I 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVTiGNTIV^ 64 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

M : M I I M I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 



QY 121. KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I I I I I I I I I I I I M I I I I I : I I I I I I : I I : I II I : | : I | : | | | | : | : | 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

I I I I : I II I I I I I I I I II I I I I I I : : II I : I : : I I I I I I I : I I I I :: I I I I I I : I 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWI FSVL 244 



QY 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I I I : I I I : I I I I I I I I I I * M I I I M I I I : I I I I : I I I : I : 

Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRALGFLNSVINPVFYFLLGDHFRD 304 



Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I II I I I 
Db 305 MLMNQLRHNFKSLTSF 320 



RESULT 3 
US-09-947-922-2 

; Sequence 2, Application US/09947922 
; Patent No. 6680373 

GENERAL INFORMATION: 

APPLICANT: Conley, Pamela B. 
; Jantzen, Hans-Michael 

TITLE OF INVENTION: NOVEL PURINERGIC RECEPTOR 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORGAN, LEWIS & BOCKIUS LLP 
STREET: 1800 M Street, N.W. 
CITY: Washington 
STATE: D.C. 
COUNTRY: USA 
ZIP: 20036-5869 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/947,922 

FILING DATE: 07-Sep-2001 
; CLASSIFICATION: <Unknown> 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/08/74 9, 7 07 

FILING DATE: 15-NOV-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: Adler, Reid G. 

REGISTRATION NUMBER: 30,988 

REFERENCE/ DOCKET NUMBER: 0444 8 1-5010-01-US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 202-467-7000 
; TELEFAX: 202-467-7176 

INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 334 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 

SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-09-947-922-2 

Query Match 74.3%; Score 1226.5; DB 4; Length 334; 

Best Local Similarity 71.5%; Pred. No. 4.7e-93; 

Matches 226; Conservative 42; Mismatches 47; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I : I : M II II I I I I II II III: hill I I : I I : | : I I I I I I I : I I I I 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFVVGVLGNTIVVYGYIFSLKNWNSSNIYLFN 64 



Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0 

IIHI M I I I I I : I I : I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I : : 

Db 65 LSVSDIAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

QY 121 KYPFREHFLQKKEFAILISLAWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I I I I I I II I I I I I I I I II I : I I I I II : I I : I II I : I : I I : I I I I : | : I 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATAiPLDKPQRLVVLAVVIFSIL 240 

I I I I : I I I I I I I I I I I I I I I I I I I : : | M : I : : I I I I I I I : I I I I :: I I I I I I : I 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I M : I II : I I I I II I I I III I I I : I I I I I I I I I I : | | | | : I I I : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRALGFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I f I I I I 
Db 305 MLMNQLRHN FKS LT S F 32 0 



RESULT 4 

US-08-559-524A-4 

; Sequence 4, Application US/08559524A 
; Patent No. 5871963 

GENERAL INFORMATION: 

APPLICANT: Conley, Pamela B. 
APPLICANT: Jantzen, Hans-Michael 
TITLE OF INVENTION: NOVEL PURINERGIC RECEPTOR 
; NUMBER OF SEQUENCES: 14 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORGAN, LEWIS & BOCKIUS LLP 
; STREET: 1800 M Street, N.W. 

CITY: Washington 
STATE: D.C. 
; COUNTRY: USA 

ZIP: 20036-5869 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/559 , 52 4A 

FILING DATE: 15-NOV-1995 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
; NAME: Adler, Reid G. 

REGISTRATION NUMBER: 30,988 

REFERENCE/DOCKET NUMBER: 044481-5010-00-US 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 202-467-7000 

TELEFAX: 202-467-7176 
; INFORMATION FOR SEQ ID NO: 4: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 373 amino acids 



TYPE: amino acid 

STRANDEDNESS: 
; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-559-524A-4 

Query Match 28.7%; Score 474; DB 2; Length 373; 

Best Local Similarity 36.6%; Pred. No. 2.3e-31; 



Matches 


105; Conservative 58; Mismatches 116; Indels 8; Gaps 


6 


QY 


z u 


YYLSAFYAIEFIFGLLGNWWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 
11111:111111 : : : : | | | | : : I | : | | | : : : | | : : | || | | | 
YYLPAWILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 


79 


Db 


JZ 


111 


QY 


80 


ANDKG-T YGDVLCI SNRYVLHTNLYTSI LFLTFI SMDRYLLMKYPFREHFLQKKEFAILI 
1 : 1 1 : 1 1 : : 1 1 1 I 1 1 1 1 II II | | : | | : | | : | : | 
FNKTDWIFGDAMCKLQRFIFHVNLYGSILFLTCISAHRYSGWYPLKSLGRLKKKNAVYI 


138 


Db 


112 


171 


QY 


139 


SLAWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLI YSLCLTLLGFLIPLS 
1 : 1 1 : 1 : : 1 : 1 : : : 1 : II 1 1 II : 1 1 : 1 : 1 I 
SVLWLIVVVGISPILFYSGTGIRKNKTITCYDTTSDEYLRSYFI YSMCTTVAMFCVPLV 


197 


Db 


172 


231 


QY 


198 


VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 


256 


Db 


232 


= : 1 : 1 1 : : II 1 I I : : : : | : : : | : | : | : : : : | | 

LI LGCYGLI VRALI YKDLDNS PLRRKS I YLVI I VLTVFAVS YI P FHVMKTMNLRARL 


288 


QY 


257 


D- SWPQGCT-QKAI KS I YTLTRPLAFLNSAINPI FYFLMGDHYREML 301 

1 1:1 : : 1 : 1 1 1 1 1 II : : 1 1 1 1 1 1 1 : 1 I 

D FQT PEMCAFNDRVYAT YQVTRGLAS LN S CVDP I L YFLAGDT FRRRL 335 




Db 


289 





RESULT 5 
US-08-749-707-4 

; Sequence 4, Application US/08749707 

; Patent No. 6063582 

; GENERAL INFORMATION: 

; APPLICANT: Conley, Pamela B. 

APPLICANT: Jantzen, Hans-Michael 

TITLE OF INVENTION: NOVEL PURINERGIC RECEPTOR 

NUMBER OF SEQUENCES : 14 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORGAN, LEWIS & BOCKIUS LLP 
; STREET: 1800 M Street, N.W. 

CITY: Washington 
STATE: D.C. 
COUNTRY: USA 
; ZIP: 20036-5869 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/749,707 

FILING DATE: 15-NOV-1996 
CLASSIFICATION: 536 
ATTORNEY/AGENT INFORMATION: 



I 



; NAME: Adler, Reid G. 

REGISTRATION NUMBER: 30,988 

REFERENCE/ DOCKET NUMBER: 044481-5010-01-US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 202-467-7000 
; TELEFAX: 202-467-7176 

INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 373 amino acids 
TYPE: amino acid 
; STRANDEDNESS : 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-749-707-4 

Query Match 28.7%; Score 474; DB 3; Length 373; 

Best Local Similarity 36.6%; Pred. No. 2.3e-31; 

Matches 105; Conservative 58; Mismatches 116; Indels 8; Gaps 6; 

Qy 20 YYLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS Y 79 

11111:111111 :: ::| || |: : | | : | || : : : | | : : | | | | | | 
Db 52 YYLPAWILVFIIGFLGNSVAIWMFVFHMKPWSGISVTMF^ 111 

Qy 8 0 ANDKG-T YGDVLCI SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYPFREHFLQKKEFAI LI 138 

I : I I : I I :: I I I I I I I I I I I I II : I I : I I : I : I 

Db 112 FNKTDWIFGDAMCKLQRFIFHVNLYGSILFLTCISAHRYSGWYPLKSLGRLKKKNAVYI 171 

Qy 139 SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

I : I I : I : : I : I : : : I : I I I I I I : I I : I : I | 

Db 172 SVLVWLIVWGISPILFYSGTGIRKNKTITCYDTTSDEYLRSYFIYSMCTTVAMFCVPLV 231 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

: : I : I I : : I I I I I : : : : I : : : I : I : I : : : : | | 
Db 232 LILGCYGLIVRALIYKDLDNS PLRRKS I YLVI I VLTVFAVS YI P FHVMKTMNLRARL 288 

Qy 257 D-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 

I I : I : : I : I I I I I M : : I I I I I I I : I I 

Db 289 DFQT P EMCAFNDRVYAT YQVT RGLAS LN S CVD P I L YFLAGDT FRRRL 335 



RESULT 6 
US-09-947-922-4 

; Sequence 4, Application US/09947922 
; Patent No. 6680373 

GENERAL INFORMATION: 

APPLICANT: Conley, Pamela B. 
; Jantzen, Hans-Michael 

; TITLE OF INVENTION: NOVEL PURINERGIC RECEPTOR 

; NUMBER OF SEQUENCES: 14 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORGAN, LEWIS & BOCKIUS LLP 

STREET: 1800 M Street, N.W. 
; CITY: Washington 

; STATE: D.C. 

COUNTRY: USA 

ZIP: 20036-5869 
; COMPUTER READABLE FORM: 



; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 09/ 94 7 , 922 

FILING DATE: 07-Sep-2001 
; CLASSIFICATION: <Unknown> 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/08/74 9, 707 
FILING DATE: 15-NOV-1996 
ATTORNEY/AGENT INFORMATION: 
NAME: Adler, Reid G. 
; REGISTRATION NUMBER: 30,98 8 

REFERENCE/DOCKET NUMBER: 044481-5010-01-US 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 202-467-7000 

TELEFAX: 202-467-7176 
INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 373 amino acids 
TYPE: amino acid 
STRANDEDNESS: <Unknown> 
; TOPOLOGY: linear 

MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
US-09-947-922-4 

Query Match 28.7%; Score 474; DB 4; Length 373; 

Best Local Similarity 36.6%; Pred. No. 2.3e-31; 

Matches 105; Conservative 58; Mismatches 116; Indels 8; Gaps 6; 

Qy 20 YYLSAFYAI EFI FGLLGNVTVVFGYLFCMKNWNS SNVYLFNLS I S DFAFLCTLPI LI KS Y 79 

11111:111111 :: ::| || |: :||:|||:::|| :: Ml II I 
Db 52 YYLPAVYILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNIJUiADFLYVl.TLPALIFYY 111 

QY 80 ANDKG-T YGDVLCI SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYP FREHFLQKKEFAI LI 138 

I : I I = I I : : I I I I I I I I I I II II : I I : I I : I : I 

Db 112 FNKTDWIFGDAMCKLQRFIFHVNLYGSILFLTCISAHRYSGWYPLKSLGRLKKKNAVYI 171 

Qy 139 SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

I : I I : I : : I : I : : : I : II I I I I : I I : I : I I 

Db 172 SVLVWLI VWGI S P I LFYSGTGI RKNKTITCYDTT S DEYLRS YFI YSMCTTVAMFCVPLV 231 

Qy 198 V^CFFYYKMWFLKRRSQQQATALPL-DKPQRLVVTAWIFSILFTPYHIMRNLRIASRL 256 

I :| I : : II I I I : : : : I : : : | : I : I : : : : || 
Db 232 LILGCYGLIVRALIYKDLDNS PLRRKS I YLVI I VLT VFAVS YI P FHVMKTMNLRARL 288 

Qy 257 D-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 

I hi : : I : I I I I I I I : : I I I I I I I : I I 

Db 28 9 DFQTPEMCAFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 335 



RESULT 7 

US-08-513-974B-374 

; Sequence 374, Application US/08513974B 
; Patent No. 6114139 



GENERAL INFORMATION: 

APPLICANT: Hinuma, Shuji 
APPLICANT: Hosoya, Masaki 
APPLICANT: Fujii, Ryo 
APPLICANT: Ohtaki, Tetsuya 
APPLICANT: Fukusumi, Shoji 
APPLICANT: Ohgi, Kazuhiro 

TITLE OF INVENTION: G PROTEIN COUPLED RECEPTOR PROTEIN, 
TITLE OF INVENTION: PRODUCTION, AND USE THEREOF 
NUMBER OF SEQUENCES: 380 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: DIKE, BRONSTEIN, ROBERTS & CUSHMAN, LLP 

STREET: 130 Water Street 

CITY: Boston 

STATE: MA 

COUNTRY: USA 

ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE : Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/513 , 974B 

FILING DATE: 14-SEP-1995 

CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/JP95/ 01599 

FILING DATE: 10-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-093989 

FILING DATE: 19-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-057186 

FILING DATE: 16-MAR-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-007177 

FILING DATE: 20-JAN-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-326611 

FILING DATE: 28-DEC-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-270017 

FILING DATE: 02-NOV-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236357 

FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236356 

FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189274 

FILING DATE: ll-AUG-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189273 

FILING DATE: ll-AUG-1945 
PRIOR APPLICATION DATA: 



APPLICATION NUMBER: JP 6-189272 
; FILING DATE: ll-AUG-1994 

; ATTORNEY/ AGENT INFORMATION: 

NAME: Resnick, David S. 
; REGISTRATION NUMBER: 34,235 

; REFERENCE/ DOCKET NUMBER: 45753 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-523-3400 

TELEFAX: 617-523-6440 
; INFORMATION FOR SEQ ID NO: 374: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 362 amino acids 

TYPE: amino acid 

STRANDEDNESS: 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-513-974B-374 

Query Match 28.7%; Score 473.5; DB 3; Length 362; 

Best Local Similarity 35.9%; Pred. No. 2.4e-31; 

Matches 110; Conservative 57; Mismatches 122; Indels 17; Gaps 8; 

Qy 10 WLA TEAILNK YYLSAFYAI EFI FGLLGN VTVVFGYLFCMKNWNS SNVYLFN 60 

II I : M III I : I I I I I I : : :: | | : | : : | | : | | 

Db 22 WAAGNATTKCSLTKTGFQFYYLPTVYILVFITGFLGNSVAIWMFVFHMRPWSGISVYMFN 81 

Qy 61 LSISDFAFLCTLPILIKSYANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLL 119 

I : M : : I I I II II : I I I : I I :: I I I I I I II I I I I : I I 

Db 82 LALADFLYVLTLPALI FYYFNKTDWIFGDVMCKLQRFI FHVNLYGSILFLTCI SVHRYTG 141 

Qy 120 MKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFIN-SVPKEEGSNCIDYASSGNPE 178 

: : I : II : I : : I I I I I I : I : I : I : : | | : 

Db 142 VA/HPLKSLGRLKKKNAVYVSSLVWALWAVIAPILFYSGIGVRRNKTITCYDTTADEYLR 201 

Qy 179 HNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPL-DKPQRLVVLAWIF 237 

: M : I I : III I : I : I I : : | | | | | : : : : | 
Db 202 SYFVYSMCTTVFMFCIPFIVILGCYGLIVKALIYKDLDNS PLRRKST YLVI I VLTVF 258 

Qy 238 SILFTPYHIMRNLRIASRLD-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGD 2 95 

: : : I : I : I : I : : I I I II I : : I : I I I I I | | : : I I I I I I I 

Db 259 AVS Y L P FHVMKT LN L RARL D FQT P QMC AFN D KVYAT YQ VT RG LAS LN S C VD P I L Y FLAGD 318 

Qy 2 96 HYREML 301 

: I I 

Db 319 TFRRRL 324 



RESULT 8 

US-09-102-710B-3 

; Sequence 3, Application US/09102710B 

; Patent No. 6479630 

; GENERAL INFORMATION: 

; APPLICANT: Coleman, Roger 

; APPLICANT: Au-Young, Janice 

; APPLICANT: Stuart, Susan G. 

; TITLE OF INVENTION: A NOVEL HUMAN PURINERGIC P2U RECEPTOR 
; FILE REFERENCE: PF-0038-1 DIV 



; CURRENT APPLICATION NUMBER: US/09/102 , 7 10B 
; CURRENT FILING DATE: 1998-06-22 
; NUMBER OF SEQ ID NOS : 3 

SOFTWARE: PERL Program 
; SEQ ID NO 3 
; LENGTH: 374 
; TYPE: PRT 

; ORGANISM: Rattus norvegius 

FEATURE : 
; NAME/KEY: misc_f eature 

OTHER INFORMATION: RNU09402 
US-09-102-710B-3 

Query Match 22.5%; Score 372; DB 4; Length 374; 

Best Local Similarity 33.8%; Pred. No. 5.1e-23; 

Matches 96; Conservative 49; Mismatches 125; Indels 14; Gaps 7; 

Qy 19 KY-YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 77 

M I I : : I I II : : : I : I I I : I I : I : I : : I I : : I I : I : 
Db 32 KYVLLPVSYGWCVLGLCLNWALYIFLCRLKTWNASTTYMFHLAVSDSLYAASLPLLVY 91 

Qy 78 S YA-NDKGT YGDVLC I SNRYVLHTNLYT S I LFLT FI SMDRYLLMKYP FREH FLQKKE FAI 136 

II I : III I : : : I I I I II I II I I I : I I : I : | 

Db 92 YYAQGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRSLGVLRPLHSLRWGHARYAR 151 

Qy 137 LISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

:: I I I I I : | | : : : | | : I : I I : I I : I 

Db 152 RVAAVVWVLVLACQTPVLYFVTTS WGTRITCHDTSDRELFSHFVAYS 211 

Qy 197 SVMCFFYYKMWFLKRRSQQQA TALPLDKPQ — RLWLAWI FS I L FT P YH I MRN L R 251 

I : : I I : I I : I III I : I : I : : I : : I I : I : I I 

Db 212 SIILVCY- VLMARRLLKPAYGTTGLPRAKRKSVRTIALVLAVFALCFLPFHVTRTLY 2 67 

Qy 252 IASRLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMG 294 

: I I II II I : I I I I I I I : : I : | M | 

Db 2 68 YSFR — SLDLSCHTLNAINMAYKITRPLASANSCLDPVLYFLAG 309 



RESULT 9 

US-08-513-974B-373 

Sequence 373, Application US/08513974B 
Patent No. 6114139 
GENERAL INFORMATION: 

APPLICANT: Hinuma, Shuji 
APPLICANT: Hosoya, Masaki 
APPLICANT: Fujii, Ryo 
APPLICANT: Ohtaki, Tetsuya 
APPLICANT: Fukusumi, Shoji 
APPLICANT: Ohgi, Kazuhiro 

TITLE OF INVENTION: G PROTEIN COUPLED RECEPTOR PROTEIN, 
TITLE OF INVENTION: PRODUCTION, AND USE THEREOF 
NUMBER OF SEQUENCES: 38 0 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: DIKE, BRONSTEIN, ROBERTS & CUSHMAN, LLP 
STREET: 130 Water Street 
CITY: Boston 
STATE: MA 



COUNTRY: USA 

ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE : Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/5 13 , 974B 

FILING DATE: 14-SEP-1995 

CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/ JP95/01599 

FILING DATE: 10-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-093989 

FILING DATE: 19-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-057186 

FILING DATE: 16-MAR-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-007177 

FILING DATE: 20-JAN-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-326611 

FILING DATE: 28-DEC-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-270017 

FILING DATE: 02-NOV-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236357 

FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236356 

FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189274 

FILING DATE: ll-AUG-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189273 

FILING DATE: ll-AUG-1945 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189272 

FILING DATE: ll-AUG-1994 
ATTORNEY/AGENT INFORMATION: 

NAME : Resnick, David S. 

REGISTRATION NUMBER: 34 , 235 

REFERENCE/DOCKET NUMBER: 45753 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-523-34 00 

TELEFAX: 617-523-644 0 
INFORMATION FOR SEQ ID NO: 373: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 373 amino acids 

TYPE: amino acid 

STRANDEDNESS: 

TOPOLOGY: linear 



; MOLECULE TYPE: peptide 
US-08-513-974B-373 



Query Match 22.5%; Score 370.5; DB 3; Length 373; 

Best Local Similarity 33.5%; Pred. No. 6.8e-23; 

Matches 94; Conservative 50; Mismatches 130; Indels 7; Gaps 5; 



Qy 


19 


KY-YLSAFYAIEFIFGLLGMVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 
II 1 1 : : 1 I II : : : 1 : 1 1 1 : 1 i : 1 : 1 : : 1 I : : I I : | : 
KYVLLPVSYGWCVLGLCLNWALYIFLCRLKIWNASTTYMFHLAVSDSLYAASLPLLVY 


77 


Db 


32 


91 


Qy 


78 


SYA-NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAI 

II 1 ■ III 1 : : : 1 1 1 1 1 II 1 1 1 1 1 : 1 1 : I : : | 
YYARGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLHSLRWGRARYAR 


136 


JJD 




151 


Qy 


137 


LISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLI YSLCLTLLGFLIPL 

: : 1 1 1 1 | : | | : : : 1 1 :: 1 : 1 1 : 1 1 : I 
RVAAVVWVLVLACQAPVLYFVTTSVRGTRITCHDTSARELFSHFVAYSSVMLGLLFAVPF 


196 


Db 


152 


211 


Qy 


197 


SVMCFFYYKMWFLKRRSQQQATALPLDKPQ — RLWLAWI FS I LFTP YHIMRNLRIAS 
II: 1 1 1 : : II 1 : 1 : 1 : : 1 : : 1 I : I : | | : 
SVILVCYVLMARRLLKPAYGTTGGLPRAKRKSVRTIALVLAVFALCFLPFHVTRTLYYSF 


254 


Db 


212 


271 


Qy 


255 


RLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMG 2 94 

1 1 II II 1 : 1 1 II 1 1 1 :: 1 : 1 1 1 1 

R — SLDLSCHTLNAINMAYKITRPLASANSCLDPVLYFLAG 310 




Db 


272 





RESULT 10 
US-08-153-848-28 

; Sequence 28, Application US/08153848 

; Patent No. 5759804 

; GENERAL INFORMATION: 

APPLICANT: Godiska, Ronald 

APPLICANT: Gray, Patrick W. 
; APPLICANT: Schweikart, Vicki L. 

; TITLE OF INVENTION: No. 5759804el Seven Transmembrane Receptors 

NUMBER OF SEQUENCES: 64 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Marshall, O'Toole, Gerstein, Murray & 

ADDRESSEE: Bicknell 

STREET: 6300 Sears Tower, 233 South Wacker Drive 

CITY: Chicago 

STATE: Illinois 

COUNTRY: USA 
; ZIP: 60606 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.25 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/153, 848 

FILING DATE: 

CLASSIFICATION: 514 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: US 07/977,452 



FILING DATE: 17-NOV-1992 
; ATTORNEY/ AGENT INFORMATION: 

NAME: No. 5759804and, Greta E. 
REGISTRATION NUMBER: 35,302 
REFERENCE/ DOCKET NUMBER: 317 94 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (312) 474-6300 
TELEFAX: (312) 474-0448 
; TELEX: 25-3856 

INFORMATION FOR SEQ ID NO: 28: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 355 amino acids 

TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-153-848-28 

Query Match 21.4%; Score 353; DB 1; Length 355; 

Best Local Similarity 28.1%; Pred. No. 1.7e-21; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6; 

Qy 16 ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPIL 75 

: :li II:: I ||:||: III I : :::|| ||::|| |: ||| 

Db 28 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 

QY 76 I KS YANDKGT YGDVLCI SNRYVLHTNLYTS I LFLT FI SMDRYLLMKYP FREH FLQKKEFA 135 

I = I I : : : I : I I I : I I I : I I I I : : : 

Db 88 THYLINEKGLH-NAMCKFTTAFFFIGFFGSIFFITVISIDRYLAIVLAANSMNNRTVQHG 146 

Qy 136 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLI 194 

: I I I I I I I I I | : : : | : | | : : : : | | | | : 

Db 147 VT I S LGVWAAAI LVAAPQ FMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

Qy 195 PLSVMCFFYYKMV-VFLKRRSQQQATT^PLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

I I : I : I : : : : : : : : I I : : I : : | | : : | : : | | | : : | | 

Db 203 PLLIMS YCYFRI IQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMIFLETL 256 

Qy 254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

I : i : I : : : : I : I I : : I I : I I : : I I : III 
Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 



RESULT 11 
US-08-153-848-32 

; Sequence 32, Application US/08153848 
; Patent No. 5759804 

GENERAL INFORMATION: 

APPLICANT: Godiska, Ronald 
; APPLICANT: Gray, Patrick W. 
; APPLICANT: Schweikart, Vicki L. 

TITLE OF INVENTION: No. 5759804el Seven Transmembrane Receptors 
NUMBER OF SEQUENCES: 64 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & 

; ADDRESSEE : Bicknell 

STREET: 6300 Sears Tower, 233 South Wacker Drive 
CITY: Chicago 



STATE: Illinois 

COUNTRY: USA 

ZIP: 60606 
; COMPUTER READABLE FORM: 

MEDIUM TYPE : Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/153, 848 

FILING DATE: 

CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/977,452 

FILING DATE: 17-NOV-1992 
ATTORNEY/AGENT INFORMATION: 
; NAME: No. 5759804and, Greta E. 

; REGISTRATION NUMBER: 35,302 

REFERENCE/ DOCKET NUMBER: 317 94 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (312) 474-6300 

TELEFAX: (312) 474-0448 

TELEX: 25-3856 
; INFORMATION FOR SEQ ID NO: 32: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 355 amino acids 

TYPE: amino acid 
; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
US-08-153-848-32 

Query Match 21.4%; Score 353; DB 1; Length 355; 

Best Local Similarity 28.1%; Pred. No. 1.7e-21; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6; 

Qy 16 ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPIL 75 

: : II I I :: I I I : I I : I I I I ■ : : : : I I I I : : | | | : | | | 

Db 2 8 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 

Qy 76 IKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA 135 

1 : I I : : : I : I I I : I I I : I I I I : : : 

Db 88 THYLINEKGLH-NAMCKFTTAFFFIGFFGSIFFITVISIDRYLAIVLAANSMNNRTVQHG 146 

Qy 136 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLI 194 

: I I I I I I I I I | : : : | : | | : : : : I II I : 

Db 147 VTISLGVWAAAILVAAPQFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

Qy 195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

I I : I : I : : : : : : : : I I : : I : : | I : : I : : | | | : : | | 

Db 203 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMIFLETL 256 

Qy 254 SRLDSWPQGCTQKAIKSI YTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

I : i : I : : : : I : I I : : I I : I I : : I I : III 
Db 257 KLYDFFPSCDMRKDLRLALS VTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 



RESULT 12 



US-09-299-843A-28 

; Sequence 28, Application US/09299843A 
; Patent No. 6107475 

GENERAL INFORMATION: 

APPLICANT: Godiska, Ronald 
; APPLICANT: Gray, Patrick W. 
; APPLICANT: Schweikart, Vicki L. 

TITLE OF INVENTION: No. 6107475el Seven Transmembrane Receptors 
; NUMBER OF SEQUENCES : 66 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O'Toole, Gerstein, Murray & 
; ADDRESSEE: Borun 

STREET: 6300 Sears Tower, 233 South Wacker Drive 
; CITY: Chicago 

STATE: Illinois 
COUNTRY: USA 
; ZIP: 60606 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/299, 843A 
; FILING DATE: 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 09/088,337 
; FILING DATE: 01-JUN-1998 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/153,848 
FILING DATE: 17-NOV-1993 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US 07/977,452 

; FILING DATE: 17-NOV-1992 

ATTORNEY/ AGENT INFORMATION: 
NAME: Jill E. Uhl 
REGISTRATION NUMBER: 43,213 
; REFERENCE/DOCKET NUMBER: 27866/32059B 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (312) 474-6300 
TELEFAX: (312) 474-0448 
TELEX: 

; INFORMATION FOR SEQ ID NO: 28: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 355 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-09-299-843A-28 

Query Match 21.4%; Score 353; DB 3; Length 355; 

Best Local Similarity 28.1%; Pred. No. 1.7e-21; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6 

Qy 16 ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPIL 75 

: : I I II:: I II : I I : I I I I : : : : I I I I : : I I I : I I I 



Db 



28 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 



Qy 7 6 IKSYANDKGTYGDVLCISNRWLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA 135 

I : I I : : : I : I I I : I II : I I I I : : : 

Db 88 THYLINEKGLH-NAMCKFTTAFFFIGFFGS I FFITVI S IDRYLAIVLAANSMNNRTVQHG 146 

Qy 136 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLI 194 

: I I I I I I I I I I : : : | : I | : : : : | | | | : 

Db 147 VT I S LGVWAAAI LVAAPQ FMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

Qy 195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLVVLAWIFSILFTPYHIMRNLRIA 253 

I I : 1 : I : : : : : : : : I | : : | : : | | : : | : : | | | : : | | 

Db 2 03 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KL I L LWI VFFL FWT P YNVMI FL ET L 256 

Qy 254 SRLDSWPQGCTQKAIKSI YTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

I : I : I : : : : I : I I : : I I : I I : : I I : | | | 
Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 



RESULT 13 
US-09-299-843A-32 

; Sequence 32, Application US/09299843A 

; Patent No. 6107475 

; GENERAL INFORMATION: 

APPLICANT: Godiska, Ronald 
; APPLICANT: Gray, Patrick W. 

APPLICANT: Schweikart, Vicki L. 

TITLE OF INVENTION: No. 6107475el Seven Transmembrane Receptors 
; NUMBER OF SEQUENCES: 66 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O 'Toole, Gerstein, Murray & 
; ADDRESSEE: Borun 

; STREET: 6300 Sears Tower, 233 South Wacker Drive 

CITY: Chicago 
STATE: Illinois 
COUNTRY: USA 
ZIP: 60606 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/299, 843A 

FILING DATE: 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 09/088,337 

FILING DATE: 01-JUN-1998 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/153,848 

FILING DATE: 17-NOV-1993 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/977,452 

FILING DATE: 17 -NOV- 19 92 
ATTORNEY/AGENT INFORMATION: 
; NAME: Jill E. Uhl 



REGISTRATION NUMBER: 43,213 
; REFERENCE/ DOCKET NUMBER: 27866/32059B 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: (312) 474-6300 
TELEFAX: (312) 474-0448 
TELEX: 

; INFORMATION FOR SEQ ID NO: 32: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 355 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-09-299-843A-32 

Query Match 21,4%; Score 353; DB 3; Length 355; 

Best Local Similarity 28.1%; Pred. No. 1.7e-21; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6; 

Qy 16 I LNKYYLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I S DFAFLCTLPI L 75 

:M II:: I ll:||: III I : :::|| ||::|| |: III 

Db 2 8 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 

QY 76 I KS YANDKGT YGDVLCI SNRYVLHTNLYTSI LFLT FI SMDRYLLMKYP FREHFLQKKEFA 135 

I : I I : : : I : I I I : I I I : I I I I : : : 

Do 8 8 THYLINEKGLH-NAMCKFTTAFFFIGFFGSIFFITVISIDRYLAIVLAANSMNNRTVQHG 14 6 

QY 136 ILISLAVW7VLVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLI YSLCLTLLGFLI 194 

: I I I I I I I I I | : : : | : | | : : : : MM: 

Db 147 VT I S L GVWAAAI L VAAP Q FMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

QY 195 PLSVMCFFYYKMV-VFLKRRSQQQATAXPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

i I : I : I : : : : : : : : I I : : I : : I I : : | : : M I : : I I 

Db 203 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMIFLETL 256 

QY 254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

1:1 :l :: ::| :M : MM I |: :| I : III 
Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 



RESULT 14 
US-09-088-337B-28 

; Sequence 28, Application US/09088337B 
; Patent No. 6348574 
; GENERAL INFORMATION: 
; APPLICANT: Godiska, Ronald 

Gray, Patrick W. 
; Schweikart, Vicki L. 

; TITLE OF INVENTION: No. 6348574el Seven Transmembrane Receptors 

; NUMBER OF SEQUENCES: 66 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O'Toole, Gerstein, Murray & 
; Bo run 

STREET: 6300 Sears Tower, 233 South Wacker Drive 

CITY: Chicago 

STATE: Illinois 

COUNTRY: USA 

ZIP: 60606 



COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/088, 337B 
FILING DATE: Ol-Jun-1998 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/153,848 
FILING DATE: 17-NOV-1993 
APPLICATION NUMBER: US 07/977,452 
FILING DATE: 17-NOV-1992 
ATTORNEY/AGENT INFORMATION: 

NAME: No. 6348574and, Greta E. 
REGISTRATION NUMBER: 35,302 
REFERENCE/ DOCKET NUMBER: 31794 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (312) 474-6300 
TELEFAX: (312) 474-0448 
TELEX: 25-3856 
INFORMATION FOR SEQ ID NO: 28: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 355 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
US-09-088-337B-28 

Query Match 21.4%; Score 353; DB 4; Length 355; 

Best Local Similarity 28.1%; Pred. No. 1.7e-21; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6; 

Qy 16 I LNKYYLSAFYAI EFI FGLLGNVT WFGYLFCMKNWNS SNVYLFNLS I S DFAFLCTLPI L 75 

: :ll II:: I Ihlh III I : :::|| ||::|| |: III 

Db 28 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 

Qy 76 IKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA 135 

I : I I : : : I : I I I : I I I : I I I I : : : 

Db 8 8 THYLINEKGLH-NAMCKFTTAFFFI GFFGS I FFITVI S I DRYLAI VLAANSMNNRTVQHG 146 

Qy 136 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLI 194 

: I I I I I I I I I | : : : | : | | : : : : MM: 

Db 147 VT I S L GVWAAAI LVAAP QFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

Qy 195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLVVXAVVIFSILFTPYHIMRNLRIA 253 

I I : I : I : : : : : : : : | | : : | : : | | : : | : : | M : : I I 

Db 203 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMI FLETL 256 

Qy 254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

1=1 : I : : : : I : II : : I I : I I : : I I : III 
Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 



RESULT 15 



US-09-088-337B-32 

; Sequence 32, Application US/09088337B 
; Patent No. 6348574 

GENERAL INFORMATION: 

APPLICANT: Godiska, Ronald 
; Gray, Patrick W. 

; Schweikart, Vicki L. 

TITLE OF INVENTION: No. 6348574el Seven Transmembrane Receptors 
NUMBER OF SEQUENCES: 66 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O'Toole, Gerstein, Murray & 
; Bo run 

STREET: 6300 Sears Tower, 233 South Wacker Drive 
; CITY: Chicago 

; STATE: Illinois 

COUNTRY: USA 
ZIP: 60606 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 9/ 08 8 , 337B 

FILING DATE: 01-Jun-1998 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US 08/153,848 

FILING DATE: 17-NOV-1993 

APPLICATION NUMBER: US 07/977,452 

FILING DATE: 17-NOV-1992 
ATTORNEY/AGENT INFORMATION: 
; NAME: No. 6348574and, Greta E. 

REGISTRATION NUMBER: 35,302 
; REFERENCE/ DOCKET NUMBER: 317 94 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: (312) 474-6300 

TELEFAX : (312) 474-0448 

TELEX: 25-3856 
; INFORMATION FOR SEQ ID NO: 32: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 355 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
; SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

US-09-088-337B-32 

Query Match 21.4%; Score 353; DB 4; Length 355; 

Best Local Similarity 28.1%; Pred. No. 1.7e-21; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6; 

Qy 16 I LNKYYLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I S DFAFLCTLPI L 75 

: : I I I 1 : : I I I : I I : I I I I : : : : I I I I : : I I I : I t I 

Db 28 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 



Qy 



76 IKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA 135 



i-it • • • i • i i i • i i i • i i i i • ; 

Db 8 8 THYLINEKGLH-NAMCKFTTAFFFI GFFGS I FFITVI S I DRYLAI VLAANSMNNRTVQHG 14 6 

Qy 136 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLI 194 

: I I I I I I I I I I : : : I : I I : : : : I I I I : 

Db 147 VT I S LGVWAAAI LVAAPQ FMF- TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

QY 195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

I I : I : I : : : : : : : : I I : : | : : I I : : | : : | | | : : | | 

Db 203 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMIFLETL 256 

Qy 254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

I : I : I : : : : I : I I : : I I : I I : : I I : III 
Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 



Search completed: August 23, 2004, 17:04:14 
Job time : 33 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 

Perfect score : 
Sequence : 

Scoring table: 



August 23, 2004, 16:56:04 ; Search time 40 Seconds 

(without alignments) 
762.318 Million cell updates/sec 

US-09-891-138A-2 
1650 

1 MAQNL S C ENWLAT EAI LN K Y REMLI S KFRQ YFKS LT S FRT 317 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283366 



Database 



PIR_78:* 
pirl : * 
pir2:* 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. 


Score 


Match 


Length 


DB 


ID 


1 


474 


28 


7 


373 


2 


JC4162 


2 


474 


28 


7 


373 


2 


JC4737 


3 


472.5 


28 


6 


362 


2 


S33733 


4 


373.5 


22 


6 


373 


2 


A47556 


5 


353 


21 


4 


355 


2 


JC4304 


6 


352 


21 


3 


365 


2 


S68679 


7 


336.5 


20 


4 


360 


2 


A57160 


8 


336 


20 


4 


328 


2 


155450 


9 


335.5 


20 


3 


375 


2 


A54946 


10 


333 


20 


2 


308 


2 


150241 


11 


332.5 


20 


2 


354 


2 


158186 


12 


332 


20 


1 


328 


2 


JC4800 


13 


328 


19 


9 


355 


2 


A45177 



Description 



P2Y receptor - bov 
G protein-coupled 
G protein-coupled 
ATP receptor P2u - 
orphan G protein-c 
G protein-coupled 
chemokine (C-C) re 
G protein-coupled 
P-2U nucleotide re 
G protein-coupled 
probable G protein 
P2Y6 receptor - hu 
chemokine (C-C) re 



14 


326.5 


19 . 


. 8 


360 


2 


JC4587 


chemokine (C-C) re 


1 O 


OZD . D 


1 Q 


. / 




Z 




somatostatin recep 


16 


318.5 


19 , 


. 3 


O c n 
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2 


A48 921 


interleukin-8 rece 


17 


317 . 5 


19 , 


. 2 


355 


2 


A55733 


G protein-coupled 


1 o 

lo 


316.5 


19 , 


. 2 


38 4 


2 


A4 724 9 


brain-specific som 


19 


315.5 


19 , 


. 1 


355 


2 


JQ1231 


interleukin-8 rece 


20 


315 


19 , 


. 1 


355 


2 


JC5067 


G protein-coupled 


2 1 


314 


19 , 


. 0 


369 


2 


JC2083 


somatostatin recep 


22 


314 


19 . 


. 0 


369 


2 


B41795 


somatostatin recep 


23 


314 


19 , 


. 0 


370 


2 


JC5549 


heptahelical P2Y5- 


24 


313 


19 , 


. 0 


344 


2 


T09508 


intron 17 purinerg 


25 


312 . 5 


18 , 


. 9 


360 


2 


A53611 


interleukin-8 rece 


z 6 


311 


18 . 


r> 
. O 


o a o 

358 


2 


A5 3752 


interleukin-8 rece 


2 / 




1 0 , 


Q 

, o 


38 4 


2 


JC4 62 9 


somatostatin recep 


28 


309 


18 , 


► 7 


355 


2 


149339 


macrophage inflamm 


29 


308 


18 . 


. 7 


369 


2 


D41795 


somatostatin recep 


30 


307 . 5 


18 , 


. 6 


369 


2 


A45291 


somatostatin recep 


31 


306 


18 , 


. 5 


356 


2 


149340 


MIP-1 alpha recept 


32 


306 


18 . 


, 5 


362 


2 


A39714 


G protein-coupled 


33 


306 


18 . 


. 5 


391 


2 


A39297 


somatostatin recep 


34 


305 


18 , 


. 5 


34 6 


2 


S29248 


somatostatin recep 


35 


304.5 


18 . 


. 5 


361 


2 


B45680 


G protein-coupled 


36 


303 


18 . 


> 4 


350 


2 


A39445 


interleukin-8 rece 


37 


303 


18 . 


. 4 


391 


2 


A41795 


somatostatin recep 


38 


303 


18 . 


, 4 


391 


2 


C41795 


somatostatin recep 


39 


302 


18 . 


. 3 


366 


1 


0ORTB2 


bradykinin recepto 


4 0 


301.5 


18 , 


, 3 


369 


2 


JC5068 


G protein-coupled 


41 


301 


18, 


,2 


355 


2 


G02436 


chemokine (C-C) re 


42 


299.5 


18, 


,2 


353 


2 


A53858 


bradykinin recepto 


43 


298 


18, 


.1 


354 


2 


T09353 


G protein-coupled 


44 


297 


18, 


,0 


387 


2 


169202 


G protein-coupled 


45 


295.5 


17. 


,9 


363 


2 


157955 


somatostatin recep 



ALIGNMENTS 



RESULT 1 
JC4162 

P2Y receptor - bovine 

C; Species: Bos primigenius taurus (cattle) 

C;Date: 12-Oct-1995 #sequence_revision 10-Nov-1995 #text_change 24-Sep-1999 
C;Accession: JC4162 

R;Henderson, D.J.; Elliot, D.G.; Smith, G.M.; Webb, T.E.; Dainty, I. A. 

Biochem. Biophys . Res. Commun. 212, 648-656, 1995 

A; Title: Cloning and characterisation of a bovine P2Y receptor. 

A; Reference number: JC4162; MUID : 95352058 ; PMID: 7626079 

A; Accession: JC4162 

A;Molecule type: mRNA 

A; Residues: 1-373 <HEN> 

A;Cross-references: EMBL:X87628; NID: gl032484 ; PIDN : CAA60958 . 1 ; PID:gl032485 

A; Experimental source: aortic endothelial cell 

C; Genetics : 

A; Gene: bovp2y 

C; Superf amily : ATP receptor P2u 

C; Keywords: glycoprotein; phosphoprotein; receptor; transmembrane protein 
F; 52-77/Domain : transmembrane #status predicted <TM1> 



F; 8 8- 111 /Domain : transmembrane #status predicted <TM2> 
F; 124-150/Domain : transmembrane #status predicted <TM3> 
F; 171-191/Domain: transmembrane #status predicted <TM4> 
F; 214-237/Domain: transmembrane tstatus predicted <TM5> 
F;261-282/Domain: transmembrane #status predicted <TM6> 
F;305-328/Domain: transmembrane #status predicted <TM7> 

F; 11, 27 , 113, 197/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F;258/Binding site: phosphate (Ser) (covalent) (by protein kinase A) #status 
predicted 

Query Match 28.7%; Score 474; DB 2; Length 373; 

Best Local Similarity 36.6%; Pred. No. 4e-32; 

Matches 105; Conservative 58; Mismatches 116; Indels 8; Gaps 6; 

YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 7 9 
11111:111111 : : : : | | | | : : | | : | | | : : : | | : : | | | | | | 



: I I : I I :: I I I I I I I I I I I I II : I I : I I : I : I 



I : I I : I : : I : I : : : I : II I I I I : I I : I : I I 



Qy 


20 


Db 


52 


Qy 


80 


Db 


112 


Qy 


139 


Db 


172 


Qy 


198 


Db 


232 


Qy 


257 


Db 


289 



I I : : II I I I : : : : | : : : | : | : | : : : : | | 
VRALI YKDLDNS PLRRKS I YLVI IVLTVFAVS YI P FHVMKTMN L RARL 2 8 8 



I : I I I I I I I :: I I I I I I I : I I 



RESULT 2 
JC4737 

G protein-coupled receptor P2Y1 - human 

N; Alternate names: P2Y1 purinergic receptor; P2Y1 purinoceptor 
C; Species: Homo sapiens (man) 

C;Date: 10-May-1996 #sequence_revision 16-Aug-1996 #text_change 17-Nov-2000 
C;Accession: JC4737; JC4615; S54253 

R; Janssens, R. ; Communi, D.; Pirotton, S.; Samson, M. ; Parmentier, M. ; 
Boeynaems, J.M. 

Biochem. Biophys. Res. Commun. 221, 588-593, 1996 

A; Title: Cloning and tissue distribution of the human P2Y1 receptor. 
A; Reference number: JC4737; MUID : 96205320 ; PMID: 8630005 
A; Accession: JC4737 
A;Molecule type: DNA 
A; Residues: 1-373 <JAN> 

A/Cross-references: GB:S81950; NID : gl839438 ; PIDN : AAB47091 . 1 ; PID:gl839439 
R;Ayyanathan, K. ; Webbs, T.E.; Sandhu, A.K.; Athwal, R.S.; Barnard, E.A. ; 
Kunapuli, S.P. 

Biochem. Biophys. Res. Commun. 218, 783-788, 1996 

A; Title: Cloning and chromosomal localization of the human P2Y1 purinoceptor. 
A; Reference number: JC4615; MUID : 96158 962 ; PMID: 8579591 
A;Accession: JC4615 



A; Molecule type: mRNA 
A; Residues: 1-373 <AYY> 

A;Cross-references : GB:U42029; NID: gll47730; PIDN : AAA97872 . 1 ; PID:gll47731 

A; Experimental source: erythro leukemia cells 

R;Leon, C; Vial, C; Cazenave, J.; Gachet, C. 

submitted to the EMBL Data Library, May 1995 

A; Description : Cloning of a human putative P2Y receptor. 

A; Reference number: S54253 

A; Access ion: S 54 25 3 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-137,139-373 <LEO> 

A; Cross-references: EMBL:Z49205; NID:g798835; PIDN : CAA8 9066 . 1 ; PID:g798836 

C; Comment: This receptor belongs to a family of G protein-coupled receptors. It 

responds to both ADP and ATP, and has several serine/threonine phosphorylation 

residues in the carboxyl terminus. 

C; Genetics : 

A; Gene: p2Yl; GDB : P2RY1 

A/Cross-references: GDB: 677125; OMIM: 601167 

A;Map position: 3pter-3qter 

C; Super family : ATP receptor P2u 

C; Keywords: G protein-coupled receptor; glycoprotein; phosphoprotein; 
transmembrane protein 

F; 52-77/Domain : transmembrane #status predicted <TM1> 
F; 88-111/Domain: transmembrane #status predicted <TM2> 
F; 124-152/Domain: transmembrane #status predicted <TM3> 
F; 171-191/Domain : transmembrane #status predicted <TM4> 
F; 214-237/Domain : transmembrane #status predicted <TM5> 
F; 261-282/Domain : transmembrane #status predicted <TM6> 
F; 305-328/Domain : transmembrane #status predicted <TM7> 

F; 11, 27, 113, 197/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F; 258 , 336/Binding site: phosphate (Ser) (covalent) (by protein kinase A) #status 
predicted 

F; 330, 339/Binding site: phosphate (Thr) (covalent) (by protein kinase C) #status 
predicted 

F; 343/Binding site: phosphate (Ser) (covalent) (by protein kinase C and 
calmodulin-dependent kinase) #status predicted 

Query Match 28.7%; Score 474; DB 2; Length 373; 

Best Local Similarity 37.3%; Pred. No. 4e-32; 

Matches 107; Conservative 55; Mismatches 117; Indels 8; Gaps 6; 

Qy 20 YYLSAFYAI EFI FGLLGNVTVVFGYLFCMKNWNS SNVYLFNLS I S DFAFLCTLPI LI KS Y 79 

I I I I I : I I I I I I : : : : | | | | : : I I : I I I : : : I I : : I I I I I I 
Db 52 YYLPAVYILVFI I GFLGNSVAIWMFVFHMKPWSGI SVYMFNLALADFLYVLTLPALI FYY 111 



Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I : I I : I I :: I I I I I I I I I I I I II : I I : I I : I I I 

Db 112 FNKTDWI FGDAMCKLQRFI FHVNLYGSI LFLTCI SAHRYSGWYPLKSLGRLKKKNAICI 171 



Qy 139 SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

I : I I : I : : I : 1 : : I I : II I I I I : I I : I : I I 

Db 172 SVLVWLI VWAI S P I LFYS GTGVRKNKT ITC YDTT S DEYLRS YFI YSMCTTVAMFCVPLV 231 



Qy 198 WICFFYYKMWFLKRRSQQQATALPL-DKPQRLVVIAWIFSILFTPYHIMRNLRIASRL 256 

: : I : I I : : II I I I : : : : I : : : I : I : I : : : : I I 
Db 232 LILGCYGLIVRALIYKDLDNS P L RRKS I YLVI I VLT VFAVS YIP FH VMKTMN LRARL 288 



Qy 257 D-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 

I I I : : I : I I I I I I I : : I I I I I I I : I I 

Db 289 D FQT P AMCAFN DRVYAT YQ VT RGLAS LN S C VD P I LYFLAGDT FRRRL 335 

RESULT 3 
S33733 

G protein-coupled receptor - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 06-Jan-1995 #sequence_revision 06-Jan-l995 #text_change 24-Sep-1999 
C;Accession: S33733 

R;Webb, T.E.; Simon, J.; Krishek, B.J.; Bateson, A.N.; Smart, T.G.; King, B.F.; 
Burnstock, G. ; Barnard, E.A. 
FEBS Lett. 324, 219-225, 1993 

A; Title: Cloning and functional expression of a brain G-protein-coupled ATP 
receptor. 

A; Reference number: S33733; MUID: 93285340; PMID: 8508924 
A;Accession: S33733 
A; Status: preliminary 
A;Molecule type: mRNA 
A; Residues: 1-362 <WEB> 

A;Cross-references : EMBL:X73268; NID:g395084; PIDN : CAA51716 . 1 ; PID:g395085 
C; Superf amily : ATP receptor P2u 

C;Keywords: G protein-coupled receptor; transmembrane protein 

Query Match 28.6%; Score 472.5; DB 2; Length 362; 

Best Local Similarity 35.9%; Pred. No. 5.2e-32; 

Matches 110; Conservative 58; Mismatches 121; Indels 17; Gaps 8; 

Qy 10 WLA TEAILNK YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I : II III I : I I I I I I : : : : I I : I : : I I : I I 

Db 22 WAAGNATTKCSLTKTGFQFYYLPTVYILVFITGFLGNSVAIWMFVFHMRPWSGISVYMFN 81 . 

Qy 61 LSISDFAFLCTLPILIKSYANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLL 119 

I : : : I I : : I II I I II : I I I : I I : : I II I I I I I I I i I : I I 

Db 82 LALADFLYVLTLPALI FYYFNKTDWI FGDVMCKLQRFI FHVNLYGS I LFLTCI SVHRYTG 141 

Qy 120 MKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPE 178 

: : I : II : I : : I I I I I I : | : | : : I : : I I : 

Db 142 WHPLKSLGRLKKKNAVYVSSLVWALWAVIAPILFYSGTGVRRNKTITCYDTTADEYLR 2 01 

Qy 179 HNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPL-DKPQRLVVTiAVVIF 237 

: I I : I I : III I : I : I I : : II I I I :: : : I 
Db 202 SYFVYSMCTTVFMFCIPFIVILGCYGLIVKALIYKDLDNS PLRRKSIYLVIIVLTVF 258 

Qy 238 SILFTPYHIMRNLRIASRLD-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGD 295 

: : : I : I : I : I : : I I I III : : I : II I I I I I : : I I I M I I 

Db 259 AVSYLPFHVMKTLNLRARLDFQTPQMCAFNDKVYATYQVTRGLASLNSCVDPILYFLAGD 318 

Qy 296 HYREML 301 

: I I 

Db 319 T FRRRL 324 



RESULT 4 
A47556 



ATP receptor P2u - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 13-Jan-1995 #sequence_revision 13-Jan-1995 #text_change 24-Sep-1999 
C;Accession: A47556 

R;Lustig, K.D.; Shiau, A.K.; Brake, A. J.; Julius, D. 
Proc. Natl. Acad. Sci. U.S.A. 90, 5113-5117, 1993 

A;Title: Expression cloning of an ATP receptor from mouse neuroblastoma cells. 

A; Reference number: A47556; MUID : 932817 07 ; PMID: 7685114 

A;Accession: A47556 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-373 <LUS> 

A; Cross-references : GB:L14751; NID:g309457; PIDN : AAA39871 . 1 ; PID:g309458 
C; Super family : ATP receptor P2u 
C; Keywords: transmembrane protein 

Query Match 22.6%; Score 373.5; DB 2; Length 373; 

Best Local Similarity 33.5%; Pred. No. 9.9e-24; 

Matches 94; Conservative 50; Mismatches 130; Indels 7; Gaps 5; 

Qy 19 KY- YLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I S DFAFLCTLPI LI K 77 

II I I : : I I II : : : I : I I I : I I : I : I : : I I : : I I : I : 
Db 32 KYVLLPVSYGWCVLGLCLNWALYIFLCRLKTWNASTTYMFHLAVSDSLYAASLPLLVY 91 



Qy 78 SYA-NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAI 136 

I I I : III I ::: I I I I I I I I II I I : I I : I : : I 

Db 92 YYARGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLHSLRWGRARYAR 151 



Qy 137 LISIAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

: : I I M | : | | : : : | | : : I : I I : I I : I 

Db 152 RVAAVW^LVLACQAPVLYFVTTSVRGTRITCHDTSARELFSHFVAYSSVMLGLLFAVPF 211 



Qy 197 SVMCFFYYKMWFLKRRSQQQATALPLDKPQ — RLWLAWI FS I LFTP YHIMRNLRI AS 254 

II: II I : : II I : I : I : : I : : I I : I : I I *: 

Db 212 SVILVCYVLMARRLLKPAYGTTGGLPRAKRKSVRTIALVLAVFALCFLPFHVTRTLYYSF 271 



Qy 2 55 RLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMG 294 

I I II II I : I I I I I I I : : | : | M I 

Db 272 R — SLDLSCHTLNAINMAYKITRPLASANSCLDPVLYFLAG 310 



RESULT 5 
JC4304 

orphan G protein-coupled receptor - human 
N; Alternate names: V28 protein 
C; Species: Homo sapiens (man) 

C;Date: 16-Nov-1995 #sequence_revision 08-Feb-1996 #text_change 19-May-2000 
C;Accession: JC4304 

R;Raport, C.J.; Schweickart, V.L.; Eddy Jr., R.L.; Shows, T.B.; Gray, P.W. 
Gene 163, 295-299, 1995 

A;Title: The orphan G-protein-coupled receptor-encoding gene V28 is closely 
related to genes for chemokine receptors and is expressed in lymphoid and 
neuraltissues . 

A; Reference number: JC4304; MUID : 96011651 ; PMID: 7590284 
A; Accession : JC4304 
A; Molecule type: mRNA 
A; Residues: 1-355 <RAP> 



A; Cross-references : GB:U20350; NID:g665580; PIDN : AAA91783 . 1 ; PID:g66558l 
A; Experimental source: peripheral blood mononuclear cell 
C; Comment: This protein is a cell-surface receptor which recognizes 
extracellular signals and transduces those signals into an intracellular 
response . 

C; Comment: This protein is a key regulator of many immune and homeos tatic 
responses, and interacts between the nervous and immune systems. 
C; Genetics : 
A; Gene: v28 

A; Map position: 3pter-p21 

C; Super family : vertebrate rhodopsin 

C; Keywords: G protein-coupled receptor; lymphokine; transmembrane protein 

F; 35-57/Domain : transmembrane #status predicted <TM1> 

F; 66-8 8 /Domain : transmembrane #status predicted <TM2> 

F; 104-125/Domain : transmembrane ((status predicted <TM3> 

F; 14 6- 165 /Domain : transmembrane #status predicted <TM4> 

F; 197-2 17 /Domain : transmembrane #status predicted <TM5> 

F; 230-254/Domain : transmembrane #status predicted <TM6> 

F; 2 7 5-2 9 6/ Domain : transmembrane #status predicted <TM7> 

Query Match 21.4%; Score 353; DB 2; Length 355; 

Best Local Similarity 28.1%; Pred. No. 4.8e-22; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6; 

Qy 16 ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPIL 75 

: : I I I I : : I I I : I I : I I I I : : : : I I II : : I I hill 

Db 28 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 



Qy 76 I KS YANDKGT YGDVLCI SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYPFREHFLQKKEFA 135 

I : I I : : : I : I I I : I I I : I I I I : : : 

Db 88 THYLINEKGLH-NAMCKFTTAFFFIGFFGSIFFITVISIDRYLAIVLAANSMNNRTVQHG 146 



Qy 136 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLI 194 

: I I I I I I I I I | : : : | : | | : : : : MM: 

Db 147 VT I S LGVWAAAI LVAAPQFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 2 02 



Qy 195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

II :| : |:::: :: ::| h : I : : I I : : I : : I I I : : I I 

Db 203 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMIFLETL 256 



Qy 254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

I : I : I : : : : I : I I : : I I : I I : : I I : III 
Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 



RESULT 6 
S68679 

G protein-coupled receptor - human 
C; Species: Homo sapiens (man) 

C;Date: 15-Feb-1997 #sequence_revision 13-Mar-1997 #text_change 20-Jun-2000 
C; Access ion: S68 679 

R;Stam, N.J.; Klomp, J.; van de Heuvel, M. ; Olijve, W. 
FEBS Lett. 384, 260-264, 1996 

A; Title: Molecular cloning and characterization of a novel orphan receptor 
(P(2P)) expressed in human pancreas that shows high structural homology to the 
P (2U) purinoceptor . 

A; Reference number: S68679; MUID : 96197801 ; PMID: 8617367 



A; Accession: S68 67 9 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-365 <STA> 

A;Cross-references: EMBL:X96597; NID : gl2 96631 ; PIDN : CAA65415 . 1 ; PID:gl296632 

C; Superf amily: ATP receptor P2u 

C; Keywords: G protein-coupled receptor 

Query Match 21.3%; Score 352; DB 2; Length 365; 

Best Local Similarity 31.1%; Pred. No. 6e-22; 

Matches 90; Conservative 59; Mismatches 122; Indels 18; Gaps 6; 

Qy 22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY-A 80 

t I I : I : I I I : : : : | : : | : : : I : I : I : : I I : : : I I I I II 
Db 38 LPVSYAWFVXGLGLNAPTLWLFIFRLRPWDATATYMFHLALSDTLYVLSLPTLIYYYAA 97 

Qy 81 NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILISL 140 

: : : I : I I : : : I I I I : I I I I I I : I I I : : I I : I I : I 

Db 98 HNHWPFGTEICKFVRFLFYWNLYCSVLFLTCISVHRYLGICHPLRALRWGRPRLAGLLCL 157 

Qy 141 AVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLI YSLCLTLLGFLIPLSVMC 200 

I I I : I : : I I I : : I I I : I : : I : I I : I I 

Db 158 AWLWAGCLVPNLFFWTSNKGTTVLCHDTTRPEEFDHYVHFSSAVMGLLFGVPCLVTL 217 

Qy 201 FFYYKMWFLKRR SQQQATALPLDKPQRL WLAWI FS I LFT P YHIMRNLRI A 253 

I I II I I : : I : I : : : : I : : I I : I I I : 
Db 218 VCYGLMA RRLYQPLPGSAQSSSRL RS LRT I AWLT VFAVC FVP FH I T RT I Y YL 270 

Qy 254 SRLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 

: I I I : : I : I I I I I I I : : I : I I I I I I I 

Db 271 ARL— LEADCRVLNIVNWYKVTRPLASANSCLDPVLYLLTGDKYRRQL 317 



RESULT 7 
A57160 

chemokine (C-C) receptor 4 - human 
N;Alternate names: C-C CKR-4 
C; Species: Homo sapiens (man) 

C;Date: 10-Nov-1995 #sequence_revision 10-Nov-1995 #text_change 21-Jul-2000 
C;Accession: A57160 

R; Power, C.A. ; Meyer, A.; Nemeth, K. ; Bacon, K.B.; Hoogewerf, A.J.; Proudfoot, 
A.E.I. ; Wells, T.N.C. 

J. Biol. Chem. 270, 19495-19500, 1995 

A; Title: Molecular cloning and functional expression of a novel CC chemokine 

receptor cDNA from a human basophilic cell line. 

A; Reference number: A57160; MUID: 95370289; PMID:7642634 

A;Accession: A57160 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 
A; Residues: 1-360 <POW> 

A;Cross-references: GB:X85740; NID : gl370103 ; PIDN : CAA59743 . 1 ; PID:g971452 

A;Note: source clone K5-5 

C; Genetics : 

A; Gene: GDB : CMKBR4 

A;Cross-references : GDB: 677463 

A;Map position: 3p21-3p21 

C; Superf amily : vertebrate rhodopsin 



C;Keywords: G protein-coupled receptor; glycoprotein; phosphoprotein; 
transmembrane protein 

F; 4 0-65/ Domain : transmembrane #status predicted <TMl> 
F; 7 6-97/Domain : transmembrane #status predicted <TM2> 
F; 112- 13 3/ Domain : transmembrane #status predicted <TM3> 
F; 151-175/Domain: transmembrane #status predicted <TM4> 
F; 208-226/Domain : transmembrane ((status predicted <TM5> 
F; 243-264/Domain : transmembrane #status predicted <TM6> 
F;291-308/Domain: transmembrane #status predicted <TM7> 
F;29-276, 110-187/Disulf ide bonds: #status predicted 

F; 72, 350/Binding site: phosphate (Ser) (covalent) (by casein kinase II) #status 
predicted 

F; 145/Binding site: phosphate (Ser) (covalent) (by protein kinase C) ((status 
predicted 

F; 183, 194/Binding site: carbohydrate (Asn) (covalent) ((status predicted 

F; 321/Binding site: phosphate (Thr) (covalent) (by protein kinase C) ((status 

predicted 

Query Match 20.4%; Score 336.5; DB 2; Length 360; 

Best Local Similarity 29.4%; Pred. No. 1.2e-20; 

Matches 93; Conservative 60; Mismatches 132; Indels 31; Gaps 9; 

Qy 14 EAILNKYYL SAF YAIEFIFGLLGNVTWFGYLFCMKNWN 52 

I : I : I It II I :: I : I I I I I I I I III 

Db 13 ESIYSNYYLYESIPKPCTKEGIKAFGELFLPPLYSLVFVFGLLGNSVWL-VLFKYKRLR 71 

Qy 53 S-SNVYLFNLSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTF 111 

I : : I I I I I : I I I : : I I I I I : : I I : : I : I I : 

Db 72 SMTDWLLNLAISDLLFVFSLPFW-GYYAADQWFGLGLCKMISWMYLVGFYSGIFFVML 130 

Qy 112 ISMDRYLLMKYPFREHFLQKKEFAILISIAWALVTLEVLPMLTFINSVPKEEGSNCIDY 171 

: I : I 1 I I : : : : : : I I I I : : I I I : : | 

Db 131 MSIDRYLAIVHAVFSLRARTLTYGVITSLATWSVAVFASLPGFLFSTCYTERNHTYCKTK 190 

Qy 172 ASSGNPEHNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLW 231 

I : : : | | : : | | : | | | : | | | | : : | : | : | : : : 

Db 191 YSLNSTTWKVLSSLEINILGLVIPLGIM-LFCYSMII RTLQHCKNEKKNKAVKMIF 245 

Qy 232 LAWIFSILFTPYHIMRNLRIASRLDSWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFY 290 

I I : I : I I I : I : I I : III::: I | | | : : : I I I I 

Db 24 6 AVWLFLGFWTPYNIVLFLETLVELEVL-QDCTFERYLDYAIQATETLAFVHCCLNPIIY 304 

Qy 2 91 FLMGDHYREMLISKFR 306 

I : I : : I : : : I : 
Db 305 FFLGEKFRKYILQLFK 32 0 



RESULT 8 
155450 

G protein-coupled P2 receptor - rat 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 19-May-2000 
C;Accession: 155450 

R;Chang, K. ; Hanaoka, K. ; Kumada, M. ; Takuwa, Y. 
J. Biol. Chem. 270, 26152-26158, 1995 

A;Title: Molecular cloning and functional analysis of a novel P2 nucleotide 
receptor . 



A;Reference number: 155450; MUID : 96064682 ; PMID:7592819 
A; Accession : 155450 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-328 <RES> 

A;Cross-references: GB:D63665; NID : gl066007 ; PIDN : BAA098 16 . 1 ; PID:gl066008 
C;Superfamily: ATP receptor P2u 
C;Keywords: G protein-coupled receptor 

Query Match 20.4%; Score 336; DB 2; Length 328; 

Best Local Similarity 29.9%; Pred. No. 1.2e-20; 

Matches 85; Conservative 52; Mismatches 137; Indels 10; Gaps 4; 

AFYAIEFIFGLLGNVTWFGYLFCMKNWNSSMVYLFNLSISDFAFLCTLPILIKSYA- 8 0 
I : : = I I llh : III I I :: : I : I : I I : I I : I I 



I I : I I : : : I I : I I I I I I 



Qy 


22 


Db 


29 


Qy 


81 


Db 


89 


Qy 


140 


Db 


149 


Qy 


200 


Db 


209 


Qy 


255 


Db 


266 



II: I I : I I : : I I I : I 



: I : I II: I I : I : : I I I I : I I 



I I I I I I 



RESULT 9 
A54946 

P-2U nucleotide receptor - human 
C; Species: Homo sapiens (man) 

C;Date: ll-Nov-1994 #sequence_revision ll-Nov-1994 #text_change 17-Mar-1999 
C;Accession: A54 94 6 

R;Parr, C.E.; Sullivan, D.M. ; Paradiso, A.M.; Lazarowski, E.R.; Burch, L.H.; 
Olsen, J.C.; Erb, L. ; Weisman, G.A.; Boucher, R.C.; Turner, J.T. 
Proc. Natl. Acad. Sci. U.S.A. 91, 3275-3279, 1994 

A; Title: Cloning and expression of a human P-2U nucleotide receptor, a target 
for cystic fibrosis pharmacotherapy. 

A; Reference number: A54946; MUID : 9421184 6; PMID: 8159738 

A;Accession: A54946 

A; Status: preliminary 

A;Molecule type: mRNA; protein 

A; Residues: 1-375 <PAR> 

A;Cross-references : GB:U07225 

A;Note: parts of this sequence were confirmed by protein sequencing 
C; Genetics : 

A; Gene: GDB : P2RY2 ; HP2U; P2U 

A;Cross-references: GDB:362713; OMIM:600041 
A; Map position: llql3 . 5-llql4 . 1 
C;Superfamily: ATP receptor P2u 



C; Keywords: G protein-coupled receptor; transmembrane protein 



Query Match 20.3%; Score 335.5; DB 2; Length 375; 

Best Local Similarity 33.1%; Pred. No. 1.5e-20; 

Matches 93; Conservative 52; Mismatches 127; Indels 9; Gaps 7; 

Qy 19 KY-YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 77 

M I I : : I I I : : : I : I I I : I I : I : I : : I I : : I I : I : 
Db 32 K YVL L P VS Y G WC VL GL C LN AVG L Y I FL C RL KT WN AS T T YMFH LAVS D AL YAAS L P L L VY 91 

QY 78 SYA-NDKGTYGDVLC1SNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAI 136 

M I : Ml I : : : I I II I I I M I I I : I I : II : : | 

Db 92 YYARGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLRSLRWGRARYAR 151 

Qy 137 LISLAWALWLEVXPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

: : I I I I I | : | | : : : I I :: : I I : I I : I 

Db 152 RVAGA VWVL VLAC QAP VL Y FVT T S ARG P-LTCHDT SAP E L F S R FVAY S S VML GL L FAVP F 210 

Qy 197 SVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAVV — IFSILFTPYHIMRNLRIAS 254 

: I : I I I : : : II I : : : I I I : I : : I I : I : I I : 
Db 211 AVILVCYVlJyiARRLLKPAYGTSGGLPRAKRKSVRTIAWLAVFALCFLPFHVTRTLYYSF 270 

Qy 255 RLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMG 294 

I I II II I : II I I | | : : | : M I I 

Db 271 R — SLDLSCHTLNAINMAYKVTR-LASANSCLDPVLYFLAG 308 



RESULT 10 
150241 

G protein-coupled receptor 6H1 - chicken 
N;Alternate names: purinoceptor 6H1 
C; Species: Gallus gallus (chicken) 

C;Date: 13-Sep-1996 #sequence_revision 13-Sep-1996 #text_change 02-Jun-2000 
C;Accession: 150241; JC4618 

R; Kaplan, M.H.; Smith, D.I.; Sundick, R.S. 
J. Immunol. 151, 628-636, 1993 

A; Title: Identification of a G protein coupled receptor induced in activated T 
cells. 

A; Reference number: 150241; MUID: 93329058 ; PMID: 8393036 
A; Accession: 150241 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-308 <KAP> 

A;Cross-references : GB:L06109; NID:g304383; PIDN: AAB06587 . 1; PID:g304384 

R;Webb, T.E.; Kaplan, M.G.; Barnard, E.A. 

Biochem. Biophys. Res. Commun. 219, 105-110, 1996 

A;Title: Identification of 6H1 as a P2Y purinoceptor: P2Y5. 

A; Reference number: JC4618; MUID: 96190677 ; PMID: 8619790 

A; Accession: JC4 618 

A;Molecule type: mRNA 

A; Residues: 1-308 <WEB> 

A;Cross-references: GB:L06109; NID:g304383; PIDN : AAB06587 . 1 ; PID:g304384 
A; Experimental source: T-cells 

C; Comment: This receptor plays a role in T-cell activation. 
C; Genetics : 
A; Gene: p2Y5 

C; Superf amily : ATP receptor P2u 



C;Keywords: G protein-coupled receptor; transmembrane protein 
F; 15- 40 /Domain : transmembrane #status predicted <TMl> 
F;51-74/Domain: transmembrane #status predicted <TM2> 
F; 89-109/Domain : transmembrane #status predicted <TM3> 
F; 133-153/Domain: transmembrane #status predicted <TM4> 
F; 177-2 01/Domain : transmembrane #status predicted <TM5> 
F;227-248/Domain: transmembrane #status predicted <TM6> 
F;269-292/Domain: transmembrane #status predicted <TM7> 

Query Match 20.2%; Score 333; DB 2; Length 308; 

Best Local Similarity 28.9%; Pred. No. 1.9e-20; 

Matches 88; Conservative 52; Mismatches 123; Indels 42; Gaps 6 

Qy 19 KYYL-SAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI K 77 

Ml : : : | : | | : | : : : | : | | : I : I I : I I I I : i I I I 

Db 13 KYTLYGCVFSMVFVLGLIANCVAIYIFTFTLKVRNETTTYMLNLAISDLLFVFTLPFRIY 72 

Qy 78 S YANDKGTYGDVLCI SNRYVLHTNLYTSILFLTFI SMDRYLLMKYPFREHFLQKKEFAI L 137 

= = 11111 : : : I I : I I I I I I I I I : I I : I : : I I I I : I I : 

Db 73 YFWRNWPFGDVLCKI SVTLFYTNMYGS ILFLTCI SVDRFLAIVHPFRSKTLRTKRNARI 132 

Qy 138 ISLAWALWLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLC 18 6 

: : I I I I II : I I : : I | 

Db 133 VCVAVWITVL AGSTPASFFQSTNRQNNTEQRTCFENFPESTWKT 176 

Qy 187 LTLLGFLIP — LSVMCFFYYKMW-FLKRRSQQQATALPLDKPQRLWLAW 235 

: : : I I I I hi I I I : I : I I : : : : : | 

Db 177 YLSRIVIFIEIVGFFIPLILNVTC STMVLRTLNKPLTLSRNKLSKKKVLKMI FVHLV 233 

Qy 236 IFSILFTPYHIMRNLRIASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGD 2 95 

M I I I 5 I I I :| |::::| :| :| I :|| |: I 

Db 234 IFCFCFVPYNITLILYSIJyiRTQTWINCSVVTAVRTMYPVTLCIAVSNCCFDPIVYYFTSD 293 

Qy 296 HYREM 300 

I : 

Db 294 TNSEL 298 



RESULT 11 
158186 

probable G protein-coupled receptor - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 26-Jul-1996 #sequence_revision 26-Jul-1996 #text_change 21-Jul-2000 
C; Accession: 158186 

R;Harrison, J.K.; Barber, CM-; Lynch, K.R. 
Neurosci. Lett, 169, 85-89, 1994 

A; Title: cDNA cloning of a G-protein-coupled receptor expressed in rat spinal 

cord and brain related to chemokine receptors. 

A; Reference number: 158186; MUID : 94323113 ; PMID: 8047298 

A; Accession: 158186 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-354 <RES> 

A; Cross-references: EMBL:U04808; NID : g2558 635 ; PIDN: AAB87093 . 1 ; PID:g439861 
C; Superf amily: vertebrate rhodopsin 
C; Keywords: G protein-coupled receptor 



Query Match 20.2%; Score 332.5; DB 2; Length 354; 

Best Local Similarity 27.9%; Pred. No. 2.5e-20; 

Matches 82; Conservative 66; Mismatches 125; Indels 21; Gaps 6; 

QY 21 YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSYA 80 

: I I I I : : I I I I : I I : I I I : : : : : I I I I : : I I : , I I 

Db 34 FLSIFYSLVFTFGLVGNLLWLALTNSRKSKSITDIYLLNLALSDLLFVATLPFWTHYLI 93 

Qy 81 NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILISL 140 

: : I : : : I : I I : I I I : I I I I : : : : I | | 

Db 94 SHEGLH-NAMCKLTTAFFFI GFFGGI FFITV1 S I DRYLAI VLAANSMNNRTVQHG VT I SL 152 

Qy 141 AVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

IN I I I | : : | : | | : : : : : | | | : : | | : | 

Db 153 GWAAAI LVAS PQFMF TKRKDNECLGDYPEVLQEIWPVLRNSEVNILGFVLPLLIM 208 

QY 200 CFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIASRLDS 258 

I -:::| I: I I : : I I I : I : : I I I : I : | : 

Db 209 SFCYFRIVRTLFSCKNRKKARAI RLILLWWFFLFWTPYNIVIFLETLKFYNF 262 

Qy 259 WPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

= 1 ::| :|| : :|| | |: I I I : I : I 

Db 2 63 FPSCGMKRDLRWALSVTETVAFSHCCLNPFIYAFAGE KFRRYLRHL 308 

RESULT 12 
JC4800 

P2Y6 receptor - human 

C; Species: Homo sapiens (man) 

C;Date: 15-Oct-1995 #sequence_revision 16-Aug-1996 #text_change 17-Nov-2000 
C;Accession: JC4800; G02514 

R;Communi, D.; Parmentier, M. ; Boeynaems, J.M. 
Biochem. Biophys . Res. Commun. 222, 303-308, 1996 

A; Title: Cloning, functional expression and tissue distribution of the human 
P2Y6 receptor. 

A; Reference number: JC4800; MUID : 96222498 ; PMID: 8670200 

A; Accession : JC4800 

A;Molecule type: mRNA 

A; Residues: 1-32 8 <COM> 

A; Cross -references : EMBL:X97 058 

A; Experimental source: placenta 

R;Hammet, F.; Southey, M.C.; Somers, G.R.; Hutchins, A.M.; Venter, D.J. 
submitted to the EMBL Data Library, March 1996 
A; Reference number: H01373 
A; Accession : G02514 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A; Residues: 'M ! , 4-328 <HAM> 

A;Cross-references: EMBL:U52464; NID : gl4 07632 ; PIDN : AAB03572 . 1 ; PID:gl407633 
C; Genetics : 
A; Gene: P2Y6 

C; Super family: ATP receptor P2u 

C;Keywords: glycoprotein; placenta; receptor; transmembrane protein 
F;26-52/Domain: transmembrane #status predicted <TMMl> 
F; 63-86/Domain : transmembrane #status predicted <TMM2> 
F; 104-122/Domain: transmembrane #status predicted <TMM3> 
F; 143- 167 /Domain : transmembrane #status predicted <TMM4> 



F; 193-216/Domain: transmembrane #status predicted <TMM5> 
F;241-264/Domain: transmembrane #status predicted <TMM6> 
F;283-305/Domain: transmembrane #status predicted <TMM7> 
F;5, 173/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 20.1%; Score 332; DB 2; Length 328; 

Best Local Similarity 30.6%; Pred. No. 2.5e-20; 

Matches 90; Conservative 48; Mismatches 136; Indels 20; Gaps 6; 



Qy 


22 


LSAFYAI EFI FGLLGNVT WFGYLFCMKNWNS SNVYLFNLS I S DFAFLCTLPI LI KSYA- 
1 1 : M 1 : 1 : : : 1 1 1 1 : : : 1 : 1 : 1 1 : 1 1 : | | 
LPPVYSAVIAAGLPLNICVITQICTSRRALTRTAWTLNLALADLLYACSLPLLIYNYAQ 


80 


Db 


29 


88 




Q 1 


JNDKbl YbUVLClbNRYVLHTNLYTSILFLTFISMDRYLLMKYPFRE-HFLQKKEFAILIS 

1 : 1 1 1 1 : : : 1 1 : 1 1 1 1 1 1 1 I 1 1 1 : : 1 1 : I |: 
GDHWPFGDFACRLVRFLFYANLHGSILFLTCISFQRYLGICHPLAPWHKRGGRRAAWLVC 


139 


Db 


89 


148 


Qy 


140 


LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 
: 1 1 1 1 1 : 1 1 1 : : : 1 1 : 1 : 1 : 1 1 : : 1 1 1 : 1 : : 
VAWIJVVTTQCLPTAIFAATGIQRNRTVCYDLSPPALATHYMPYGMALTVIGFLLPFAAL 


199 


Db 


149 


208 


Qy 


200 


CFFYYKMWFLKRRSQQQATALPL DKPQRLWLAWI FS I LFT P YHIMRNLRIAS 


254 


Db 


209 


1 : 1 : 1 II: 1 1 : 1 : 1 : 1 1 1 : 1 1 : : I 
LACYCLLAC RLCRQDGPAE PVAQERRGKAARMAVWAAAFAI S FL P FH I TKTAYLAV 


265 


Qy 


255 


RLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPI-FYFLMGDHYREMLISKFRQ 307 

1 : 1 : 1 1 1 1 1 1 1 :: 1 1 1 1 1 Ml: 

RS T P GVPCTVLEAFAAAYKGT RP FAS AN SVLDPI LFYFTQ KKFRR 310 




Db 


266 





RESULT 13 
A45177 

chemokine (C-C) receptor 1 - human 

N; Alternate names: C-C CKR-1; macrophage inflammatory protein-l-alpha receptor 
C; Species: Homo sapiens (man) 

C;Date: 30-Sep-1993 #sequence_revision 30-Sep-1993 #text_change 13-Aug-1999 
C;Accession: A45177; 155671 

R;Neote, K. ; DiGregorio, D.; Mak, J.Y.; Horuk, R. ; Schall, T.J. 
Cell 72, 415-425, 1993 

A;Title: Molecular cloning, functional expression, and signaling characteristics 
of a C-C chemokine receptor. 

A; Reference number: A45177; MUID : 931614 16 ; PMID:7679328 
A;Accession: A45177 

A; Status: nucleic acid sequence not shown 
A;Molecule type : mRNA 
A; Residues: 1-355 <NE0> 

A; Cross-references : GB:L10918; NID:g292416; PIDN : AAA36543 . 1 ; PID:g292417 
A; Experimental source: HL60 cells 

A;Note: sequence extracted from NCBI backbone (NCBIP : 124876) 
R; Gao, J. 

J. Exp. Med. 177, 1421-1427, 1993 

A; Title: Structure and functional expression of the human macrophage 
inflammatory 1 alpha (MlP-lalpha) /RANTES receptor. 
A;Reference number: 155671; MUID: 93240122 ; PMID:7683036 
A;Accession: 155671 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 



A; Residues: 1-355 <RES> 

A;Cross-references: GB:L10918; NID:g292416; PIDN : AAA36543 . 1; PID:g2924l7 
C; Genetics : 

A; Gene: GDB : CMKBR1 ; CMKR-1 

A;Cross-references: GDB: 13844 6; OMIM: 601159 

A; Map position: 3p21-3p21 

C; Super family : vertebrate rhodopsin 

C; Keywords: disulfide bond; G protein-coupled receptor; glycoprotein; 
phosphoprotein; transmembrane protein 

F;36-60/Domain: transmembrane #status predicted <TM1> 
F;71-91/Domain: transmembrane #status predicted <TM2> 
F;108-129/Domain: transmembrane #status predicted <TM3> 
F; 14 7- 171/ Domain : transmembrane #status predicted <TM4> 
F;205-223/Domain: transmembrane #status predicted <TM5> 
F;240-264/Domain: transmembrane #status predicted <TM6> 
F;288-305/Domain: transmembrane #status predicted <TM7> 
F;5/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F;24-273,106-183/Disulfide bonds: #status predicted 

F;345/Binding site: phosphate (Ser) (covalent) (by casein kinase II) #status 
predicted 

Query Match 19.9%; Score 328; DB 2; Length 355; 

Best Local Similarity 29.0%; Pred. No. 5.9e-20; 

Matches 85; Conservative 63; Mismatches 127; Indels 18; Gaps 8; 

Qy 22 LSAFYAIEFIFGLLGNWWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSYAN 81 

I I : : I : I I : I I : | | : : I : : : M M : I I I Mill I 
Db 37 LPPLYSLVFVIGLVGNILWLVLVQYKRLKNMTSIYLLNLAISDLLFLFTLPFWIDYKLK 96 

QY 82 DKGTYGDVLC — ISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILIS 139 

I Ml M :l I :| lh I |: :::|||| : : : | :: | 

Db 97 DDWVFGDAMCKILSGFY — YTGLYSEIFFIILLTIDRYLAIVHAVFALRARTVTFGVITS 154 

Qy 140 LAVWALVTLEVLPMLTFINSVPKEEGSNC-IDYASSGNPEHNLIYSLCLTLLGFLIPLSV 198 

— Ml I =111 : : I : : | | :| | | | ::|| | 

Db 155 IIIWALAI LASMPGLYFSKTQWEFTHHTCSLHFPHESLREWKLFQALKLNLFGLVLPLLV 214 

Qy 199 MCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIASRLD- 257 

I I :: I II :: : I I I : : : : || : :| || : : : |: | 

Db 215 MIICYTGIIKILLRRPNEKKS KAVRLI FVIMII FFLFWTPYNL — TILI SVFQDF 267 

Qy 258 SWPQGCTQ-KAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYF 309 

: II:: : I : I : : : II : I : I : : I : I III 
Db 2 68 LFTHECEQSRHLDLAVQVTEVIAYTHCCVNPVIYAFVGERFRKYL RQLF 316 



RESULT 14 
JC4587 

chemokine (C-C) receptor 4 - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 08-Mar-1996 #sequence_revision 19-Apr-1996 #text_change 20-Jun-2000 
C; Accession: JC4587 

R;Hoogewerf, A. J.; Black, D. ; Proudfoot, A.E.I. ; Wells, T.N.C.; Power, C.A. 
Biochem. Biophys . Res. Commun. 218, 337-343, 1996 

A; Title: Molecular cloning of murine CC CKR-4 and high affinity binding of 
chemokines to murine and human CC CKR-4. 

A; Reference number: JC4587; MUID : 96136324 ; PMID: 8573157 



A; Accession: JC458 7 
A;Molecule type: mRNA 
A; Residues: 1-360 <HOO> 

A;Cross-references : EMBL:X90862; NID: gll67851; PIDN : CAA62372 . 1 ; PID:gll67852 

A; Experimental source: thymus 

C; Genetics: 

A; Gene: cc ckr-4 

C; Superf amily : vertebrate rhodopsin 

C; Keywords: glycoprotein; phosphoprotein; receptor; thymus 

F;2, 183, 194/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F; 72,202, 350/Binding site: phosphate (Ser) (covalent) (by casein kinase II) 
#status predicted 

F; 145/Binding site: phosphate (Ser) (covalent) (by protein kinase C) #status 
predicted 

F;321/Binding site: phosphate (Thr) (covalent) (by protein kinase C) ((status 
predicted 



Query Match 19.8%; Score 326.5; DB 2; Length 360; 

Best Local Similarity 28.4%; Pred. No. 8e-20; 

Matches 93; Conservative 57; Mismatches 136; Indels 41; Gaps 10; 



Qy 


14 


EAILNKYY LSAFYAI EFI FGLLGNVTWFGYLFCMPCNWN 


52 


Db 


13 


1 : 1 1 1 1 1 : : 1 : 1 1 1 1 1 1 III 
ETVYNSYYFYESMPKPCTKEGIKAFGEVFLPPLYSLVFLLGLFGNSVWL-VLFKYKRLK 


71 


Qy 


53 


S-SNVYLFNLSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTF 
1 1 1 1 1 1 : II 1 1 : : 1 1 III: : I I | : : | : | | : 
SMTDVTLLNLAISDLLFVLSLPFW-GYYAADQWVFGLGLCKIVSWMYLVGFYSGIFFIML 


111 


Db 


72 


130 


Qy 


112 


ISMDRYLLMKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDY 
: 1 : 1 1 1 1 : : : : : : I I I : : 1 1 1 1 : : 1 
MSIDRYLAIVHAVFSLKARTLTYGVITSLITWSVAVFASLPGLLFSTCYTEHNHTYCKTQ 


171 


Db 


131 


190 


Qy 


172 


ASSGNPEHNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLW 
1 : : : 1 1 : : 1 1 1 1 1 1 : 1 1 : 1 1 : : I : I : : | : : 


231 


Db 


191 


YSVNSTTWKVLS SLEINVLGLLI PLGIM-LFWYSMI I RTLQHCKNEKKNRAVRMIF 


245 


Qy 


232 


LAWIFSILFTPYHIMRNLRIASRLDSWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFY 
1 1 : 1 : II 1 : : : 1 | : III::: 1 I 1 : : : | | : | 
GVWLFLGFWTPYNWLFLETLVELEVX-QDCTLERYLDYAIQATETLGFIHCCLNPVIY 


290 


Db 


246 


304 


Qy 


291 


FLMGDHYREMLI SKFRQYFKSLTS FRT 317 
1 : 1 : 1 1 1 : 1 1 III 
FFLGE KFRKYITQL — FRT 321 




Db 


305 





RESULT 15 
JN0605 

somatostatin receptor 4 - human 
C; Species: Homo sapiens (man) 

C;Date: 30-Sep-1993 #sequence_revision 30-Sep-1993 #text_change 20-Jun-2000 

C;Accession: JN0605; JN0762; A47457 

R;Xu, Y. ; Song, J.; Bruno, J.F.; Berelowitz, M. 

Biochem. Biophys . Res. Commun. 193, 648-652, 1993 

A;Title: Molecular cloning and sequencing of a human somatostatin receptor, 
hSSTR4 . 

A;Reference number: JN0605; MUID : 93290656; PMID: 8512564 



A; Accession : JN0605 
A; Molecule type: DNA 
A; Residues: 1-388 <XUY> 

A;Cross-references: GB:L14856; NID:g292499; PIDN: AAA3 6623 . 1; PID:g292500 
R;Yamada, Y. ; Kagimoto, S . ; Kubota, A.; Yasuda, K. ; Masuda, K.; Someya, Y.; 
Ihara, Y.; Li, Q. ; Imura, H. ; Seino, S.; Seino, Y. 
Biochem. Biophys . Res. Commun. 195, 844-852, 1993 

A;Title: Cloning, functional expression and pharmacological characterization of 
a fourth (hSSTR4) and a fifth (hSSTRS) human somatostatin receptor subtype. 
A;Reference number: JN0762; MUID : 93384611 ; PMID:8373420 
A; Accession : JN0762 
A; Molecule type: DNA 
A; Residues: 1-388 <YAM> 

A;Cross-references: GB:D16826; NID:g693907; PIDN: BAA04106 . 1; PID:g693908 
R;Rohrer, L.; Raulf, F. ; Bruns, C. ; Buettner, R.; Hofstaedter, F. ; Schule, R. 
Proc. Natl. Acad. Sci. U.S.A. 90, 4196-4200, 1993 

A; Title: Cloning and characterization of a fourth human somatostatin receptor. 
A; Reference number: A47457; MUID : 93248256 ; PMID: 8483934 
A; Accession: A47457 
A; Molecule type: DNA 

A;Residues: 1-82, f T f , 84-364, 1 K 1 , 366-388 <ROH> 

A;Cross-references: GB:L07833; NID:g307429; PIDN: AAA60565 . 1; PID:g307430 

A;Note: sequence extracted f rom NCBI backbone (NCBIN : 130856, NCBIP : 130858 ) 

C; Comment: This protein mediates the diverse actions of the tetradecaptide 

somatostatin. 

C; Genetics: 

A; Gene: GDB : SSTR4 

A;Cross-references : GDB:202662; OMIM:182454 
A; Map position: 20pll . 2-20pll . 2 
A;Introns: #status absent 
C; Super family : vertebrate rhodopsin 

C; Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 
lipoprotein; phosphoprotein; thiolester bond; transmembrane protein 
F; 47-73/Domain: transmembrane #status predicted <TM1> 
F; 84-109/Domain: transmembrane #status predicted <TM2> 
F; 121-142/Domain: transmembrane #status predicted <TM3> 
F; 162-184/Domain: transmembrane #status predicted <TM4> 
F;208-238/Domain: transmembrane #status predicted <TM5> 
F;257-284/Domain: transmembrane #status predicted <TM6> 
F;291-314/Domain: transmembrane #status predicted <TM7> 
F;24/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F;119-198/Disulfide bonds: #status predicted 

F; 161, 253/Binding site: phosphate (Ser) (covalent) (by cAMP-dependent kinase) 
((status predicted 

F; 327/Binding site: palmitate (Cys) (covalent) #status predicted 

Query Match 19.7%; Score 325.5; DB 2; Length 388; 

Best Local Similarity 28.4%; Pred. No. l.le-19; 

Matches 82; Conservative 66; Mismatches 110; Indels 31; Gaps 5; 

Qy 22 LSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I S DFAFLCTLPI LI KS YAN 81 

= M: : I I : I I hi i I : : I : II I I :: : I I : : : I : II 
Db 50 I QCI YALVCLVGLVGNALVI FVILRYAKMKTATNI YLLNIAVADELFMLS VPFVAS SAAL 109 

QY 82 DKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILISLA 141 

Mill: I I : : I I : I I : I : I I I : : : I I : : | I I : I 

110 RHWPFGSVLCRAVLSVDGLNMFTSVFCLTVLSVDRYVAWHPLRAATYRRPSVAKLINLG 169 



Qy 142 VWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLI YSLCLTLLGFLIPLSVMCF 201 

II I II: I :: I I :| : :: : |||||:|: : 

Db 170 VWLASLLVTLPIAIFADTRPT^RGGQAVACNLQWPHPAWSAVBWYTFLLGFLLPVLAIGL 229 

QY 2 02 FYYKMV VFL KRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNL 250 

I :| II :| | |::: | I I I : : I I : I : : I : : : : : I 

Db 230 CYLLIVGKMRAVALRAGWQQRRRSEKKIT RLVLMWWFVLCWMPFYWQLL 281 

Qy 251 R-IASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYR 298 

: : I I : : : : : I : : I I I I I I : I : : I 

Db 282 NLWTSLDA TVNHVSLILSYANSCANPILYGFLSDNFR 319 



Search completed: August 23, 2004, 17:03:36 
Job time : 42 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 



August 23, 2004, 17:03:00 ; Search time 125 Seconds 

(without alignments) 
796.951 Million cell updates/sec 

US-09-891-138A-2 
1650 

1 MAQNLSCENWLATEAI LNKY REMLI S KFRQYFKS LT S FRT 317 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1295152 seqs, 314255058 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1295152 



Database : 



Published_Applications_AA 
/cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubpa 
/ cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubpa 
/ cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubp 
/cgn2_6/ptodata/2/pubp 
/ cgn2_6/ ptodata/ 2/pubp 
/cgn2_6/ ptodata/2/pubp 
/ cgn2_6/ ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 
/ cgn2_6/ ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 



a/US 07_PUBCOMB . pep : * 
a/ PCT_NEW_PUB . pep : * 
a /US 0 6_NEW_PUB . pep : * 
a /US 0 6_PUBCOMB . pep : * 
a/US07_NEW_PUB .pep : * 
a/PCTUS_PUBCOMB.pep: * 
a /US 0 8__NEW_PUB . pep : * 
a/US08_PUBCOMB . pep : * 
a/US0 9A_PUBCOMB.pep: * 
aa/US09B_PUBCOMB.pep: 
aa/US 09C_PUBCOMB . pep : 
aa/US09_NEW_PUB.pep: * 
aa/US 10A_PUBCOMB . pep : 
aa/US10B_PUBCOMB.pep: 
aa/US 10C_PUBCOMB . pep : 
aa/US10_NEW_PUB.pep: * 
aa/US60_NEW_PUB.pep: * 
aa/US 60 PUBCOMB . pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 
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35 
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7 
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36 
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28. 


7 
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12 
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37 
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28. 


7 


373 


12 
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10- 


375- 


157-8 
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8, Appli 


38 
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28. 


7 


373 


14 
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10- 
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135-5 
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5, Appli 


39 
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28. 


7 


373 


14 
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10- 


■092- 


135-6 
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6, Appli 


40 
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28. 


7 
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14 
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Sequence 


219, App 


41 
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28. 


7 


373 


14 


us- 
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010- 
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42 
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7 


373 


14 
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10- 


010- 
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8 , Appli 


43 
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7 
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569A-51 


Sequence 


51, Appl 



ALIGNMENTS 



RESULT 1 

US-09-891-138A-2 

; Sequence 2, Application US/09891138A 

; Publication No. US20030083245A1 

; GENERAL INFORMATION: 

; APPLICANT: Lin, Daniel Chi-Hong 



APPLICANT: Zhao, Jiagang 
APPLICANT: Chen, Jin-Long 
APPLICANT: Cutler, Gene 
APPLICANT: Tularik Inc. 

TITLE OF INVENTION: No. US2 00300 83245Alel Receptors 
FILE REFERENCE: 018781-006210US 
CURRENT APPLICATION NUMBER: US/09/ 891, 138A 
CURRENT FILING DATE: 2001-06-25 
PRIOR APPLICATION NUMBER: US 60/213,461 
PRIOR FILING DATE: 2000-06-23 
NUMBER OF SEQ ID NOS : 2 6 
SOFTWARE: PatentlnVer. 2.1 
SEQ ID NO 2 
LENGTH: 317 
TYPE: PRT 

ORGANISM: Mus mus cuius 
FEATURE : 

OTHER INFORMATION: mouse TGR18 G-protein coupled receptor (GPCR) 
US-09-891-138A-2 

Query Match 100.0%; Score 1650; DB 10; Length 317; 

Best Local Similarity 100.0%; Pred. No. 4.6e-142; 

Matches 317; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MAQN L S C ENW LAT EAI LN K Y YL S AF YAI E F I FGL L GNVT WFG YL FCMKNWN S S N VY L FN 60 

1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I 
Db 1 MAQNL S CENW LAT EAI LNKYYLSAF YAI EFIFGLLGNWVVFGYL FCMKNWN SSNVYL FN 60 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

I I I I I I I I t I I I I I I I I I I M I I I I I I I I I M I M I I I I I I I I I I I I I I I I M I I I I I II 

Db 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0 

Qy 121 KYPFREHFLQKKEFAILISLAWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

II I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I II I I I I I I I I I I M II I I 

Db 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAVVIFSIL 240 

I I I II I I II I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 24 0 

Qy 241 FTPYHIMRNLRIASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREM 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 241 FT P YHIMRNLRI ASRLDSWPQGCTQKAI KS I YTLTRPLAFLNSAINP I FYFLMGDHYREM 300 

Qy 301 LISKFRQYFKSLTSFRT 317 

I I I I I I I M I I I I I I I I 
Db 301 LISKFRQYFKSLTSFRT 317 



RESULT 2 

US-10-225-567A-567 

; Sequence 567, Application US/10225567A 

; Publication No. US20030113798A1 

; GENERAL INFORMATION: 

; APPLICANT: Lifespan Biosciences 

; APPLICANT: Brown, Joseph P. 

; APPLICANT: Burmer, Glenna C. 



; APPLICANT: Roush, Christine L. 

; TITLE OF INVENTION: ANTIGENIC PEPTIDES AND ANTIBODIES FOR G PROTEIN-COUPLED 
RECEPTORS (GPCRS) 

FILE REFERENCE: 1920-4-4 

CURRENT APPLICATION NUMBER: US/ 10/225, 567A 
CURRENT FILING DATE: 2001-12-19 
PRIOR APPLICATION NUMBER: 60/257,144 
PRIOR FILING DATE: 2000-12-19 
NUMBER OF SEQ ID NOS : 22 92 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 567 
LENGTH: 330 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-225-567A-567 

Query Match 75.1%; Score 1238.5; DB 14; Length 330; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFI FGLLGNWVVFGYLFCMKNWNSSNVYLFN 60 

II I : I : II I I II I I I I I I II III: I : I II I I : I I : I : I I M I II : I I I I 
Db 1 MAWNATCKNWIAAEAALEKYYLSIFYGIEFWGVLGNTIVVYGYIFSLKNWNSSNIYLFN 60 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

I I : I I II I I I I I : I I : I I I I I I I I I I I I I II I I I I 1 I I I I I I I I II i : I I I I : : 

Db 61 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 120 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I II M I I I I I I I I I I I I I I : I I I II I : I I : I II I : I : I I : 1 I M : I : I 
Db 121 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 18 0 

Qy 181 LIYSLCLTLLGFLIPLSViyiCFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAVVIFSIL 240 

I I II : I I I I I II I I II I I I I I I I I : : I II : I : : I I I I II I : I I I I : : I I I I I I : I 
Db 181 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 24 0 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I M : I I I : I I I I I I II I III I I I : I I I I I I I I I I I I : I I II : I I I : I : 
Db 241 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 300 

Qy 300 MLISKFRQYFKSLTSF 315 

II::: I I II I I I I 
Db 3 01 MLMNQLRHNFKSLTSF 316 



RESULT 3 

US-09-764-886-45 

; Sequence 45, Application US/09764886 

; Publication No. US20030139327A9 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 
; FILE REFERENCE: PTZ02 

; CURRENT APPLICATION NUMBER: US/ 09/764 , 8 86 
; CURRENT FILING DATE: 2001-01-17 

; Prior application data removed - consult PALM or file wrapper 
; NUMBER OF SEQ ID NOS: 8 8 



SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 45 
LENGTH: 334 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-764-886-45 

Query Match 75.1%; Score 1238.5; DB 10; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I : I : I I I I II I I I I I I II III: I : I I I I I : I I : I : I I I I I I I : I I I I 
Db 5 MAWNATCKNWLAAEAALEKYYLS I FYGIEFVVGVLGNTIVVYGYI FSLKNWNS SNI YLFN 64 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

I I : I I II I I II I : I I : I I I I I I I I I I I I M 1 I I I I I I I I I I II I I I I : I I I I : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I I I I I I I I I I I I II I I I I I : I I I I I I : I I : I I I I : I : I I : I I I I : I =1 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LI YSLCLTLLGFLI PLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAVVI FS I L 240 

I I I I : I I I I I I I I I I I I I I I I I I I : : II I : I : : I I I I I I I : I I I I :: I I I I I I : I 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVAT7VLPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIAS RLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I M I I : I M : I I I II I II I Ml I I I : I I I I I M II I I I : I M I : I I I : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 3 04 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I I I I I I 
Db 305 MLMNQLRHN FKS LT S F 32 0 



RESULT 4 

US-09-764-886-70 

; Sequence 70, Application US/09764886 

; Publication No. US2 0030139327A9 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 

FILE REFERENCE: PTZ02 
; CURRENT APPLICATION NUMBER: US/09/764 , 886 
; CURRENT FILING DATE: 2001-01-17 

; Prior application data removed - consult PALM or file wrapper 
; NUMBER OF SEQ ID NOS : 88 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 7 0 

LENGTH: 334 

TYPE : PRT 
; ORGANISM: Homo sapiens 
US-09-764-886-70 

Query Match 75.1%; Score 1238.5; DB 10; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 



Matches 228; Conservative 42; Mismatches 45; Indels 



1; Gaps 



l; 



Qy 1 MAQNLS CENWLATEAI LNKYYLSAFYAI EFI FGLLGNVT WFGYLFCMKNWNS SNVYLFN 60 

II I : I : I I I 1 II I I I I I I II III: hill I I : I I : I : II II I I M I I I I 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIVA^GYIFSLKNWNSSNIYLFN 64 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

I I : I I I I I I I I I : I I : II I I I I I I I I I I II I I I I I I I I I I I I I I I I I : I I I I : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYP FREHFLQKKEFAI LI S LAWALVTLEVLPMLT FINSVPKEEGSNCI DYAS SGNPEHN 180 

I II I I I I I I I I I I I I I I I I I : I I I I I I : I I : I II I : I : I I : I I I I : I : I 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVTVICFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

I I I I : I I I I M I I I I I I I I I I I I I : : I I I : I : : I I I I I I I : I I I I : : II M I I : I 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FT P YHIMRNLRIASRLDSWPQ-GCTQKAI KS I YTLTRPLAFLNSAINP I FYFLMGDHYRE 299 

I I I I I : I I I : I I I I I I II I II I I I I : I I I I I I I I I I I I : I I I I : I I I : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQVVINSFYIWRPLAFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I I I I I I 
Db 305 MLMNQLRHNFKSLTSF 320 



RESULT 5 

US-09-764-886-45 

; Sequence 45, Application US/09764886 

; Publication No. US2002008 6822A1 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 
; FILE REFERENCE: PTZ02 

; CURRENT APPLICATION NUMBER: US/ 09/764 ,886 
; CURRENT FILING DATE: 2001-01-17 

Prior application data removed - consult PALM or file wrapper 
; NUMBER OF SEQ ID NOS : 88 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 45 
LENGTH: 334 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-764-886-45 

Query Match 75.1%; Score . 1238 . 5 ; DB 12; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQNLS CENWLATEAI LNKYYLSAFYAI EFI FGLLGNVTVVFGYLFCMKNWNSSNVYLFN 60 

II I : I : I I I I II I Mill I I Ml: Mill I M I M I : I II I I I I : II II 
Db 5 MAWNATCKNWIAAEAALEKYYLSIFYGIEFWGVLGNTIVAAYGYIFSLKNWNSSNIYLFN 64 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

II : II M II I I M I M I I I I I I I I I I II I II I II I I I I I I I I I I I I M I I II : : 

Db 65 LSVSDLAFLCTLPMLIRS YANGNWI YGDVLCI SNRYVLHANLYTSI LFLTFI S I DRYLI I 124 



Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I I I I I I 1111111111111:1 I I I I I : I ) : I II I * I : I I : I I I I : I : I 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

I I I I : II I I I I I I I I I I I I I I I I I : : I I I : I : : I I I I I I I : I I I I : : I I M I I : I 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I II : I I I : I I I I I I II I III I I I : I I I I I I I I I I I I : I I I I : I I 1 : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I ::: I I I I I I I I 
Db 305 MLMNQLRHNFKSLTSF 32 0 



RESULT 6 

US-09-764-886-70 

; Sequence 70, Application US/09764886 

; Publication No. US20020086822A1 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 
; FILE REFERENCE: PTZ02 

; CURRENT APPLICATION NUMBER: US/ 09/764 ,886 
; CURRENT FILING DATE: 2001-01-17 

; Prior application data removed - consult PALM or file wrapper 
; NUMBER OF SEQ ID NOS : 8 8 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 70 
; LENGTH: 334 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-764-886-70 

Query Match 75.1%; Score 1238.5; DB 12; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNVTVVFGYLFCMKNWNSSNVYLFN 60 

II I : I : || I I II I Mill II Ml: Mill I I : I I : I : I I II I I I : II I I 
5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVljGNTIWYGYIFSLKNWNSSNIYLFN 64 

61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

II : I I II I I I I M I I : I I I I I II I I I I I I I M II II II I I I I I I I I I : M M :: 

65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I II I I I I II I I II I I I I II I : I I I II I : II : I II I : I : I I : I M M I : I 
12 5 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

181 LIYSLCLTLLGFLIPLSWCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 24 0 

I II I : I II I I I I I I II I I I II I II : : M I : M : I II I I I M I I I I : : I M M I : I 
185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 



QY 
Db 



Qy 

Db 



Qy 



Db 



245 



241 




Qy 



300 



MLISKFRQYFKSLTSF 315 



Db 



305 



MLMNQLRHNFKSLTSF 320 



RESULT 7 



US-09-875-076-36 

; Sequence 36, Application US/09875076 

; Publication No. US2003001752 8A1 

; GENERAL INFORMATION: 

; APPLICANT: Chen, Ruoping 

; APPLICANT: Dang, Huong T. 

; APPLICANT: Liaw, Chen W. 

; APPLICANT: Lin, I-Lin 

; TITLE OF INVENTION: Human Orphan G Protein Coupled Receptors 
; FILE REFERENCE: AREN0050 

; CURRENT APPLICATION NUMBER: US/ 09/ 87 5 , 07 6 

; CURRENT FILING DATE: 2001-06-06 

; PRIOR APPLICATION NUMBER: 09/417,044 

; PRIOR FILING DATE: 1999-10-12 

; PRIOR APPLICATION NUMBER: 60/120,416 

; PRIOR FILING DATE: 1999-02-16 

; PRIOR APPLICATION NUMBER: 60/121,851 

; PRIOR FILING DATE: 1999-02-26 

; PRIOR APPLICATION NUMBER: 60/123,946 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/123,949 

; PRIOR FILING DATE: 1999-03-12 

PRIOR APPLICATION NUMBER: 60/136,436 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,437 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,439 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,567 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/137,127 

PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/137,131 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/141,448 
; PRIOR FILING DATE: 1999-06-29 
; PRIOR APPLICATION NUMBER: 60/156,653 

PRIOR FILING DATE: 1999-09-29 
; PRIOR APPLICATION NUMBER: 60/156,633 
; PRIOR FILING DATE: 1999-09-29 
; PRIOR APPLICATION NUMBER: 60/156,555 
; PRIOR FILING DATE: 1999-09-29 
; PRIOR APPLICATION NUMBER: 60/156,634 
; PRIOR FILING DATE: 1999-09-29 
; PRIOR APPLICATION NUMBER: 60/157,280 
; PRIOR FILING DATE: 1999-10-01 
; PRIOR APPLICATION NUMBER: 60/157,294 



PRIOR FILING DATE: 1999-10-01 
PRIOR APPLICATION NUMBER: 60/157,281 
PRIOR FILING DATE: 1999-10-01 
PRIOR APPLICATION NUMBER: 60/157,293 
PRIOR FILING DATE: 1999-10-01 
PRIOR APPLICATION NUMBER: 60/157,282 
PRIOR FILING DATE: 1999-10-01 
NUMBER OF SEQ ID NOS : 7 4 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 36 
LENGTH: 334 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-875-076-36 

Query Match 75.1%; Score 1238.5; DB 12; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNWWFGYLFCMKNWNSSNVYLFN 60 

II I : I : I I I I II I I I I I I II III: I : I I I I I : I I : I : I I II I I I : II I I 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFVVGA/XGNTIVVYGYIFSLKNWNSSNIYLFN 64 

Qy 61 LSI SDFAFLCTLPILIKSYANDKGTYGDVLCI SNRYVLHTNLYTS ILFLTFI SMDRYLLM 120 

11:11 I I I I I II : I I : I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I : i I I I : : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I I I I II I II I I I I I I I I I I : I I I I I I : I I : I II I : I : I I : I II I : I : I 

Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LI YSLCLTLLGFLI PLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAWI FS I L 240 

II I I : I I I I I I I I I II I I I I I I I I : : I I I : I : : I II I I I I : I I I I : : I I I I I I : I 

Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I I I : I I I : I I I I I I II I III I I 1 : I I I I I I I I I I I I : I I I I : I I I : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I ::: I I I I I I II 
Db 305 MLMNQLRHNFKSLTSF 320 



RESULT 8 

US-09-876-252-38 

Sequence 38, Application US/09876252 
Publication No. US20030018182A1 
GENERAL INFORMATION: 
APPLICANT: Behan, Dominic P. 
APPLICANT: Lehmann-Bruinsma, Karin 
APPLICANT: Chalmers, Derek T. 
APPLICANT: Lowitz, Kevin P. 
APPLICANT: Lin, I-Lin 
APPLICANT: Dang, Huong T. 
APPLICANT: Chen, Ruoping 
APPLICANT: Liaw, Chen W. 



; TITLE OF INVENTION: Non-Endogenous Constitively Activated Human G Protein 

Coupled Receptors 

; FILE REFERENCE: AREN-0054 

; CURRENT APPLICATION NUMBER: US/ 09/876,252 

; CURRENT FILING DATE: 2 001-06-07 

; PRIOR APPLICATION NUMBER: 09/416,760 

; PRIOR FILING DATE: 1999-10-12 

; PRIOR APPLICATION NUMBER: 09/170,496 

; PRIOR FILING DATE: 1998-10-13 

; PRIOR APPLICATION NUMBER: 60/110,060 

PRIOR FILING DATE: 1998-11-27 

; PRIOR APPLICATION NUMBER: 60/120,416 

; PRIOR FILING DATE: 1999-02-16 

PRIOR APPLICATION NUMBER: 60/121,852 

; PRIOR FILING DATE: 1999-02-26 

; PRIOR APPLICATION NUMBER: 60/109,213 

; PRIOR FILING DATE: 1998-11-20 

; PRIOR APPLICATION NUMBER: 60/123,944 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/123,945 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/123,948 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/123,951 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/123,946 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/123,949 

PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/152,524 

; PRIOR FILING DATE: 1999-09-03 

; PRIOR APPLICATION NUMBER: 60/151,114 

; PRIOR FILING DATE: 1999-08-27 

; PRIOR APPLICATION NUMBER: 60/108,029 

; PRIOR FILING DATE: 1998-11-12 

; PRIOR APPLICATION NUMBER: 60/136,436 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/136,439 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/136,567 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/137,127 

PRIOR FILING DATE: 1999-05-28 

PRIOR APPLICATION NUMBER: 60/137,131 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/141,448 

; PRIOR FILING DATE: 1999-06-29 

; PRIOR APPLICATION NUMBER: 60/136,437 

; PRIOR FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/156,555 

; PRIOR FILING DATE: 1999-09-29 

; PRIOR APPLICATION NUMBER: 60/156,634 

; PRIOR FILING DATE: 1999-09-29 

; PRIOR APPLICATION NUMBER: 60/156,653 

; PRIOR FILING DATE: 1999-09-29 

; PRIOR APPLICATION NUMBER: 60/157,280 

; PRIOR FILING DATE: 1999-10-01 



; PRIOR APPLICATION NUMBER: 60/157,294 

; PRIOR FILING DATE: 1999-10-01 

; PRIOR APPLICATION NUMBER: 60/157,281 

; PRIOR FILING DATE: 1999-10-01 

; PRIOR APPLICATION NUMBER: 60/157,282 

; PRIOR FILING DATE: 1999-10-01 

; PRIOR APPLICATION NUMBER: 60/156,633 

; PRIOR FILING DATE: 1999-09-29 

; NUMBER OF SEQ ID NOS : 146 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 38 

LENGTH: 334 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-876-252-38 

Query Match 75.1%; Score 1238.5; DB 12; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 



Matches 


228; Conservative 42; Mismatches 45; Indels 1; Gaps 


1 


Qy 


1 


MAQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNVTVVFGYLFCMKNWNSSNVYLFN 
M 1 :|:IMI II 1 Mill II IN: hill l|:||:| : 1 1 1 1 1 1 1 : 1 1 1 1 
MAWNATCKNWLAAEAALEKYYLSIFYGIEFVVGVXGNTI 


60 


Db 


5 


64 


Qy 

Db 


61 
65 


LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 
11 = 11 1 1 M 1 1 1 : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 II : : 
LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 


120 
124 


Qy 


121 


KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 M 1 1 : 1 1 : 1 I I I : 1 : | : 1 1 I I : 1 : I 
KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 


180 


Db 


125 


184 


Qy 


181 


LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 
1111:11111111111 1 1 1 1 1 1 II : : 1 1 1 : 1 : : 1 1 M 1 II : 1 1 II : : 1 1 1 1 1 1 : 1 
LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 


240 


Db 


185 


244 


Qy 


241 


FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 
M 1 II : 1 1 1 : 1 1 1 1 1 1 1 1 1 | | 1 1 1 1 : II 1 1 1 1 1 1 1 1 II : 1 1 1 1 : 1 1 1 : 1 : 
FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 


299 


Db 


245 


304 


Qy 


300 


MLISKFRQYFKSLTSF 315 
1 1 ::: 1 1 II 1 1 1 1 
MLMNQLRHNFKSLTSF 320 




Db 


305 





RESULT 9 

US-10-272-983-36 

; Sequence 36, Application US/10272983 

; Publication No. US2003014 8450A1 

; GENERAL INFORMATION: 

; APPLICANT: Chen, Ruoping 

; APPLICANT: Dang, Huong T. 

; APPLICANT: Liaw, Chen W. 

; APPLICANT: Lin, I-Lin 

; TITLE OF INVENTION: Human Orphan G Protein Coupled Receptors 
; FILE REFERENCE: AREN0050 

; CURRENT APPLICATION NUMBER: US/ 10/272 , 983 



; CURRENT FILING DATE: 2002-10-17 

PRIOR APPLICATION NUMBER: US/09/417, 044 
; PRIOR FILING DATE: 1999-10-12 
; PRIOR APPLICATION NUMBER: 60/109,213 
; PRIOR FILING DATE: 1998-11-20 

PRIOR APPLICATION NUMBER: 60/120,416 
; PRIOR FILING DATE: 1999-02-16 
; PRIOR APPLICATION NUMBER: 60/121,851 
; PRIOR FILING DATE: 1999-02-26 
; PRIOR APPLICATION NUMBER: 60/123,946 
; PRIOR FILING DATE: 1999-03-12 
; PRIOR APPLICATION NUMBER: 60/123,949 
; PRIOR FILING DATE: 1999-03-12 

PRIOR APPLICATION NUMBER: 60/136,436 
; PRIOR FILING DATE: 1999-05-28 

PRIOR APPLICATION NUMBER: 60/136,437 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,439 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,567 
; PRIOR FILING DATE: 1999-05-28 

; Remaining Prior Application data removed - See File Wrapper or PALM. 

; NUMBER OF SEQ ID NOS : 74 

; SOFTWARE: PatentlnVer. 2.1 

; SEQ ID NO 36 

LENGTH: 334 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-272-983-36 

Query Match 75.1%; Score 1238.5; DB 14; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNWVVFGYLFCMKNWNSSNVYLFN 60 

II I : I : I I I I M I I I I I I II Ml: hill I I : I I : I : I I II I I I : I I I I 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 64 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

M : I I I II I I I I : I I : I I I I I I I I I I I I I I I I M I I I I M I I I II I I : M I I : : 

Do 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

QY 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I M I I I I I I I I I I I I I I I I I : I I I I I I : I | : | I I I : | : | | : | | | | : | : | 
Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVl^VVIFSIL 240 

I I I I : I I I I I I I I I I I I I I I I I I I : : I I I : I : : I I II I I I : I I I I : : I I M I I : I 
Db 185 LI YSMCLTLLGFLI PLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAVVI FSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I M I : I I I : I I I I I I II I Ml I I | : I I I | | | | | | I I I : I I I I : I I I : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I I I I I I 
Db 305 MLMNQLRHNFKSLTSF 320 



I 



RESULT 10 
US-10-393-807-36 

; Sequence 36, Application US/10393807 

; Publication No. US20030175891A1 

; GENERAL INFORMATION: 

; APPLICANT: Chen, Ruoping 

; APPLICANT: Dang, Huong T. 

; APPLICANT: Liaw, Chen W. 

; APPLICANT: Lin, I-Lin 

; TITLE OF INVENTION: Human Orphan G Protein Coupled Receptors 

FILE REFERENCE: AREN0050 
; CURRENT APPLICATION NUMBER: US/10/393,8 07 
; CURRENT FILING DATE: 2003-03-21 
; PRIOR APPLICATION NUMBER: US/09/417,044 

PRIOR FILING DATE: 1999-10-12 
; PRIOR APPLICATION NUMBER: 60/109,213 
; PRIOR FILING DATE: 1998-11-20 
; PRIOR APPLICATION NUMBER: 60/120,416 
; PRIOR FILING DATE: 1999-02-16 
; PRIOR APPLICATION NUMBER: 60/121,851 
; PRIOR FILING DATE: 1999-02-26 
; PRIOR APPLICATION NUMBER: 60/123,946 
; PRIOR FILING DATE: 1999-03-12 
; PRIOR APPLICATION NUMBER: 60/123,949 

PRIOR FILING DATE: 1999-03-12 
; PRIOR APPLICATION NUMBER: 60/136,436 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,437 

PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,439 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,567 
; PRIOR FILING DATE: 1999-05-28 

Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS: 74 
; SOFTWARE: Patent In Ver. 2.1 
; SEQ ID NO 36 
LENGTH: 334 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-393-807-36 

Query Match 75.1%; Score 1238.5; DB 14; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 

Matches 22 8; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

QY 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I :|:IMI M I I : hill l|:||:| : I I I I I I I : I I I I 

Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIVWGYIFSLKNWNSSNIYLFN 64 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVIjHTNLYTSILFLTFISMDRYLLM 12 0 



' 1 1 11 I 1 I I I t I I i I I I I I I I I I I • 1 I I I • • 

65 LSVSDLAFLCTLPMLIRSYT^JGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 






Db 



Qy 



121 KYPFREHFLQKKEFAILISLAVWALvTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 



11 iiiiii*! ii*i ii i . I - I l*l!ll*| : I 

Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

QY 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

I I M : I I I II I I I I II I I I I I I I I : : I I I : | : : | I I I I | | : | | | | : : | : | 

Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

QY 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I M I I : I I I : I I I II I I I I III I I I : I I I I I I I I I I I I : I I I I : | | | : | : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I I I I I I I I 
Db 305 MLMNQLRHNFKSLTSF 320 



RESULT 11 
US-10-417-820A-38 

Sequence 38, Application US/10417820A 
Publication No. US20030229216A1 
GENERAL INFORMATION: 
APPLICANT: Chen, Ruoping 
APPLICANT: Liaw, Chen W. 
APPLICANT: Lowitz, Kevin 
APPLICANT: Chalmers, Derek T. 
APPLICANT: Behan, Dominic P. 

TITLE OF INVENTION: Constitutively Activated Human G Protein Coupled 
TITLE OF INVENTION: Receptors 
FILE REFERENCE: 7.US2 8.CON 

CURRENT APPLICATION NUMBER: US/ 1 0/ 4 17 , 82 OA 
CURRENT FILING DATE: 2003-04-16 
PRIOR APPLICATION NUMBER: 09/416,760 
PRIOR FILING DATE: 1999-10-12 
PRIOR APPLICATION NUMBER: 09/170,496 
PRIOR FILING DATE: 1998-10-13 
PRIOR APPLICATION NUMBER: 60/110,060 
PRIOR FILING DATE: 1998-11-27 
PRIOR APPLICATION NUMBER: 60/120,416 
PRIOR FILING DATE: 1999-02-16 
PRIOR APPLICATION NUMBER: 60/121,852 
PRIOR FILING DATE: 1999-02-26 
PRIOR APPLICATION NUMBER: 60/109,213 
PRIOR FILING DATE: 1998-11-20 
PRIOR APPLICATION NUMBER: 60/123,944 
PRIOR FILING DATE: 1999-03-12 
PRIOR APPLICATION NUMBER: 60/123,945 
PRIOR FILING DATE: 1999-03-12 
PRIOR APPLICATION NUMBER: 60/123,948 
PRIOR FILING DATE: 1999-03-12 
PRIOR APPLICATION NUMBER: 60/123,951 
PRIOR FILING DATE: 1999-03-12 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 155 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 38 
LENGTH: 334 
TYPE: PRT 



; ORGANISM: Homo sapiens 
US-10-417-820A-38 



Query Match 75.1%; Score 1238.5; DB 15; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 



Matches 


22 8; Conservative 42; Mismatches 45; Indels 1; Gaps 


1 


Qy 


1 


MAQNLSCENWLATEAI LNKYYLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFN 

II 1 :|:| MIM M Ml: IMII ||:||:| :||IIIII:||M 

MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 


60 


Db 


5 


64 


Qy 

Db 


61 
65 


LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 
M M 1 II II II 1 : 1 1 : 1 I I I || || | || | M II II II 1 II II 1 1 II II M 1 II : : 
LSVSDLAFLCTLPMLIRSYANGNWI YGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 


120 
124 


Qy 


121 


KYPFREHFLQKKEFAILISLAVW7VLVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 
M II M 1 II II 1 II M II II M 1 II 1 1 : 1 1 : 1 1 i I : | : | I : I 1 I I : | : | 
KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 


180 


Db 


125 


184 


Qy 


181 


LI YSLCLTLLGFLI PLS VMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWI FS I L 

M M M II II II II II 1 1 : : 1 || : | : : | 1 1 1 1 I I : | | | | : : | | | | | | : | 

LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 


240 


Db 


185 


244 


Qy 


241 


FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 

= M 1 = 1 1 II Ml 1 1 1 MIIMIMI 1 1 1 : 1 1 I 1 : ! I I : I : 

FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 


299 


Db 


245 


304 


Qy 


300 


MLISKFRQYFKSLTSF 315 




Db 


305 


II : : : 1 II 1 II II 
MLMNQLRHNFKSLTSF 320 





RESULT 12 

US-10-264-237-2757 

; Sequence 2757, Application US/10264237 

; Publication No. US200400094 91A1 

; GENERAL INFORMATION: 

; APPLICANT: Birse et al . 

; TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 
; FILE REFERENCE: PA131P1 

; CURRENT APPLICATION NUMBER: US/ 10/2 64 , 237 

CURRENT FILING DATE: 2002-10-04 

PRIOR APPLICATION NUMBER: PCT/US01/ 164 5 0 

PRIOR FILING DATE: 2001-05-18 
; PRIOR APPLICATION NUMBER: US 60/205,515 
; PRIOR FILING DATE: 2000-05-19 
; NUMBER OF SEQ ID NOS : 2876 

SOFTWARE: PatentlnVer. 3.1 
; SEQ ID NO 2757 
LENGTH: 334 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-264-237-2757 

Query Match 75.1%; Score 1238.5; DB 15; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 



Qy 


1 


MAQNLS CENWLATEAI LNKYYLSAFYAI EFI FGLLGNVT VVFGYLFCMKNWNS SNVYLFN 
II 1 : 1 : 1 M 1 II 1 1 M 1 1 || IN: | : | I I | | : | | : | : | | | | | | | : | | | | 
MAWNATCKNWLAAEAALEKYYLSIFYGIEFVVGVLGNTIVVYGYIFSLKNWNSSNIYLFN 


60 


Db 


5 


64 


Qy 


61 


LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 
1 1 : M 1 1 1 1 M 1 : M : 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 : 1 1 M : : 
LSVSDLAFLCTLPMLIRSYANGNWI YGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 


120 


Db 


65 


124 


Qy 


121 


KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 
1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 I : I | | | | | : | | : | | | | : | : | | : | | | | : | : | 
KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 


180 


Db 


125 


184 


Qy 


181 


LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 

1 l 1 1 : 1 1 ; , ! | ' MINI: : 1 1 1 : I : : I 1 II 1 1 1 : 1 I I I : : : | 

LIYSMCLTLLGFLIPLFVMCFFYYKIAiFLKQRNRQVATTVLPLEKPLNLVIMAWIFSVL 


240 


Db 


185 


244 




0/11 
Z 4 ± 


FTPYHIMRNLRIAS RLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 
1 I.I M : 1 1 1 : I I I I I I II 1 IN | | | :||||MIM 1 1 1 : 1 1 1 1 : 1 1 I : I : 
FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 


299 


Db 


245 


304 


Qy 


300 


MLISKFRQYFKSLTSF 315 
1 1 : : : 1 1 1 1 II I I 
MLMNQLRHNFKSLTSF 320 




Db 


305 





RESULT 13 
US-10-723-955-38 

Sequence 38, Application US/10723955 
Publication No. US2 004 0110238A1 
GENERAL INFORMATION: 
APPLICANT: Behan, Dominic P. 
APPLICANT: Chalmers, Derek T. 
APPLICANT: Lin, I-Lin 
APPLICANT: Liaw, Chen W. 
APPLICANT: Lehman-Bruinsma, Karin 
APPLICANT: Lowitz, Kevin P. 
APPLICANT: Dang, Huong T. 
APPLICANT: Chen, Ruoping 
APPLICANT: Gore, Martin 
APPLICANT: White, Carol 

TITLE OF INVENTION: Cons titutively Activated Human G Protein Coupled 
TITLE OF INVENTION: Receptors 
FILE REFERENCE: 7.US29.CON 

CURRENT APPLICATION NUMBER: US/10/723,955 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: 10/417,820 
PRIOR FILING DATE: 2003-4-16 
PRIOR APPLICATION NUMBER: 09/416,760 
PRIOR FILING DATE: 1999-10-12 
PRIOR APPLICATION NUMBER: 09/170,496 
PRIOR FILING DATE: 1998-10-13 
PRIOR APPLICATION NUMBER: 60/110,060 
PRIOR FILING DATE: 1998-11-27 
PRIOR APPLICATION NUMBER: 60/120,416 
PRIOR FILING DATE: 1999-02-16 
PRIOR APPLICATION NUMBER: 60/121,852 



; PRIOR FILING DATE : 1999-02-26 
; PRIOR APPLICATION NUMBER: 60/109,213 
; PRIOR FILING DATE: 1998-11-20 
; PRIOR APPLICATION NUMBER: 60/123,944 
; PRIOR FILING DATE: 1999-03-12 
; PRIOR APPLICATION NUMBER: 60/123,945 
; PRIOR FILING DATE: 1999-03-12 
; PRIOR APPLICATION NUMBER: 60/123,948 
PRIOR FILING DATE: 1999-03-12 

Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 14 8 
; SOFTWARE: Patentln version 3.2 
; SEQ ID NO 38 
; LENGTH: 334 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-723-955-38 



Query Match 75.1%; Score 1238.5; DB 16; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 



Qy 


1 


MAQNLS CENWLATEAI LNKYYLSAFYAI EFI FGLLGNVT WFGYLFCMKNWNS SNVYLFN 
II 1 = 1 = 1111 II 1 Mill II III: I : I I | 1 1 : I 1 : | : || 1 1 1 1 1 : I I 1 1 
MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVXGNTIVVYGYIFSLKNW 


60 


Db 


5 


64 


Qy 

Db 


61 
65 


LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 
f 1 = 1 1 1 1 1 1 1 1 1 : I I : | | | | 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 : 1 1 1 1 : : 
LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 


120 
124 


Qy 


121 


KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 
i 1 1 1 i 1 1 I | | | | | | | M | || : | M | | | : || : | | | | : | . , | : | | | | : | ; | 

KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 


180 


Db 


125 


184 


Qy 


181 


LI YSLCLTLLGFLI PLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAVVI FS I L 

1 1 1 1 : 1 1 1 1 M 1 1 1 II Illlllll: :|||:|::| |:|| ||::|MII|:| 

LI YSMCLTLLGFLI PLFVMCFFYYKI ALFLKQRNRQVATAiPLEKPLNLVIMAVVT FS VL 


240 


Db 


185 


244 


Qy 


241 


FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 
Mllhllhllllll || I Ml I I 1 = 1 1 1 1 1 1 F 1 I IM:|IM:|||:|: 
FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 


299 


Db 


245 


304 


Qy 


300 


MLISKFRQYFKSLTSF 315 
II : : : 1 1 1 II 1 1 1 
MLMNQLRHNFKSLTSF 320 




Db 


305 





RESULT 14 
US-10-782-596-36 

; Sequence 36, Application US/10782596 

; Publication No. US20040137509A1 

; GENERAL INFORMATION: 

; APPLICANT: Chen, Ruoping 

; APPLICANT: Dang, Huong T. 

; APPLICANT: Liaw, Chen W. 

; APPLICANT: Lin, I-Lin 

; TITLE OF INVENTION: Human Orphan G Protein Coupled Receptors 



; FILE REFERENCE: AREN0050 

; CURRENT APPLICATION NUMBER: US/10/7 82 , 596 

; CURRENT FILING DATE: 2004-02-19 

; PRIOR APPLICATION NUMBER: US/ 09/ 875 , 07 6 

; PRIOR FILING DATE: 2001-06-06 

; PRIOR APPLICATION NUMBER: 09/417,044 

; PRIOR FILING DATE: 1999-10-12 

; PRIOR APPLICATION NUMBER: 60/120,416 

; PRIOR FILING DATE: 1999-02-16 

; PRIOR APPLICATION NUMBER: 60/121,851 

; PRIOR FILING DATE: 1999-02-26 

; PRIOR APPLICATION NUMBER: 60/123,946 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/123,949 

; PRIOR FILING DATE: 1999-03-12 

; PRIOR APPLICATION NUMBER: 60/136,436 

PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,437 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,439 
; PRIOR FILING DATE: 1999-05-28 
; PRIOR APPLICATION NUMBER: 60/136,567 
; PRIOR FILING DATE: 1999-05-28 

Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 74 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 36 

LENGTH: 334 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-782-596-36 



Query Match 75.1%; Score 1238.5; DB 16; Length 334; 

Best Local Similarity 72.2%; Pred. No. 1.5e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 



Qy 


i 


MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 
II 1 : 1 : 1 1 1 1 M 1 1 1 1 1 1 1 1 III: 1 : 1 1 1 I 1 : 1 1 : 1 : I M 1 1 II : 1 1 1 1 
MAWNATCKNWLAAEAALEKYYLSIFYGIEFVVGVLGNTI VVYGYIFSLKNWNSSNIYLFN 


60 


Db 


5 


64 


Qy 


61 


LS I SDFAFLCTLPI LI KS YANDKGT YGDVLCI SNRYVLHTNLYTS I LFLTFI SMDRYLLM 
1 1 : 1 1 1 1 1 M 1 1 : 1 1 : 1 1 1 1 | | | M II II 1 II II 1 I I I | | | | M 1 1 1 : 1 1 1 1 : : 
LSVSDLAFLCTLPMLIRSYANGNWI YGDVLCI SNRYVLHANLYT SI LFLTFI SI DRYLI I 


120 


Db 


65 


124 


Qy 


121 


KYPFREHFLQKKEFAILISLAWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 
M 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 : 1 1 1 I I I : I | : | Ml : 1 : I I : I I | | : I : | 
KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 


180 


Db 


125 


184 


Qy 


181 


LIYSLCLTLLGFLIPLSvTytCFFYYKMWFLKRRSQQQATALPLDKPQRLVvXAW 

M M : M 1 1 1 1 1 1 1 I I II : : || I : | : : | | | | | | | : | | | | : s : , 

LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 


240 


Db 


185 


244 


Qy 


241 


FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 
1 1 t 1 1 r | | | r I t f 1 1 1 III III ll|:||||||lll 1 II : II 1 1 : I 1 1 : I : 
FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 


299 


Db 


245 


304 


Qy 


300 


MLI S KFRQYFKS LT S F 315 





Db 305 MLMNQLRHNFKSLTSF 32 0 



RESULT 15 
US-10-311-671-3 

Sequence 3, Application US/10311671 
Publication No. US2004 0072996A1 
GENERAL INFORMATION: 
APPLICANT: INCYTE GENOMICS, INC. 
APPLICANT: LAL, Preeti G. 
APPLICANT: BAUGHN, Mariah R. 
APPLICANT: HAFALIA, April J, A. 
APPLICANT: NGUYEN, Danniel B. 
APPLICANT: GANDHI, Ameena R. 
APPLICANT: KALLICK, Deborah A. 
APPLICANT: GRIFFIN, Jennifer A. 
APPLICANT: YUE, Henry 
APPLICANT: KHAN, Farrah A. 
APPLICANT: ARVIZU, Chandra S. 
APPLICANT: LU, Dyung Aina M. 
APPLICANT: TRIBOULEY, Catherine M. 
APPLICANT: LU, Yan 
APPLICANT: CHAW LA, Narinder K. 
APPLICANT: GRAUL, Richard 
APPLICANT: YAO, Monique G. 
APPLICANT: YANG, Junming 
APPLICANT: RAMKUMAR, Jayalaxmi 
APPLICANT: AU-YOUNG, Janice K. 
APPLICANT: ELLIOTT, Vicki S. 
APPLICANT: HERNANDEZ , Roberto 
APPLICANT: WALSH, Roderick T. 
APPLICANT: BOROWSKY, Mark L. 
APPLICANT: THORNTON, Michael B. 
APPLICANT: HE, Ann 

TITLE OF INVENTION: G-PROTEIN COUPLED RECEPTORS 
FILE REFERENCE: PI-0131 USN 

CURRENT APPLICATION NUMBER: US/10/311,671 
CURRENT FILING DATE: 2002-12-16 
PRIOR APPLICATION NUMBER: PCT/US01/19275 
PRIOR FILING DATE: 2001-06-15 
PRIOR APPLICATION NUMBER: 60/212,483 
PRIOR FILING DATE: 2000-06-16 
PRIOR APPLICATION NUMBER: 60/213,954 
PRIOR FILING DATE: 2000-06-22 
PRIOR APPLICATION NUMBER: 60/215,209 
PRIOR FILING DATE: 2000-06-29 
PRIOR APPLICATION NUMBER: 60/216,595 
PRIOR FILING DATE: 2000-07-07 
PRIOR APPLICATION NUMBER: 60/218,936 
PRIOR FILING DATE: 2000-07-14 
PRIOR APPLICATION NUMBER: 60/219,154 
PRIOR FILING DATE: 2000-07-19 
PRIOR APPLICATION NUMBER: 60/220,141 
PRIOR FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 35 
SOFTWARE: PERL Program 



; SEQ ID NO 3 
LENGTH: 379 
TYPE : PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/ KEY: mis c_f eature 
; OTHER INFORMATION: Incyte ID No: 3485895CD1 
US-10-311-671-3 



Query Match 75.1%; Score 1238.5; DB 12; Length 379; 

Best Local Similarity 72.2%; Pred. No. 1.8e-104; 

Matches 22 8; Conservative 42; Mismatches 45; Indels 1; Gaps 1 

^ 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFI FGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

H I - I = I I I I II I HIM M III: hill M:ll:| :|||||||:|||| 
Db 50 MAWNATCKNWLAAEAALEKYYLSIFYGIEFVVGVIjGNTI VATY'GYIFSLKNW 109 

Q y 61 LS I SDFAFLCTLPILIKS YANDKGTYGDVLCISNRYVLHTNLYTS I LFLTFI SMDRYLLM 120 

I I : N I I M I I I : I I : I I I I I I I I I I I I I I I | | | M I I I I I I I I M I : I I I I : : 

Db 110 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 169 

Q y 121 KYPFREHFLQKKEFAILISLAWAIjVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I M : I I I I I I : I I : | Ml : I : I | : || | | : | : | 

Db 170 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 229 

^ 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVIiAWIFSi 240 

I I I I : I I I I I M I I I I MINIM: :IM:|::| MIIIMM IMMIMIMI 
Db 230 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 2 89 

^ 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

' I I I I : I I I : I I I I I I II I Ml I I I : I I II I I I II I M : M I I : I I I : I : 
Db 290 FTPYHVMRNVRIASRLGSWKQYQCTQVVINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 349 

Qy 300 MLISKFRQYFKSLTSF 315 

II::: I I I II I I I 
Db 350 MLMNQLRHNFKSLTSF 365 



Search completed: August 23, 2004, 17:14:30 
Job time : 127 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: August 23, 2004, 16:55:03 ; Search time 115 Seconds 

(without alignments) 
869.733 Million cell updates/sec 

Title: US-09-8 91-138A-2 

Perfect score: 1650 

Sequence: 1 MAQNL S C ENWLATEAI LNKY REMLISKFRQYFKSLTSFRT 317 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1017041 seqs, 315518202 residues 

Total number of hits satisfying chosen parameters: 1017041 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL 25:* 



1: 


sp archea : * 


2: 


sp_bacteria: * 


3: 


sp fungi : * 


4: 


sp human : * 


5: 


sp_invertebrate : * 


6: 


sp_mammal : * 


7: 


sp_mhc: * 


8: 


sp_organelle : * 


9: 


sp_phage : * 


10 


: sp_plant : * 


11 


: sp_rodent : * 


12 


: sp virus:** 


13 


: sp vertebrate:* 


14 


: sp_unclassified: * 


15 


: sp_rvirus:* 


16 


: sp_bacteriap: * 


17 


: sp_archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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Q99mt6 mus musculu 
Q9bxa5 homo sapien 
Q8tdq8 homo sapien 
Q90x57 xenopus lae 
Q8bmj5 mus musculu 
Q9de05 raja erinac 
057466 meleagris g 
Q7zza4 brachydanio 
Q7zwq7 xenopus lae 
Q8vhp3 cavia porce 
Q8mjw8 canis famil 

Q9erk9 mus musculu 
Q9myj8 callithrix 

Q9jly8 rattus norv 
Q86va9 homo sapien 

Q8cbj0 mus musculu 
Q9myj9 oryctolagus 
Q8br50 mus musculu 
Q91zh4 rattus norv 
Q7t0x4 xenopus lae 
Q7z3s6 homo sapien 

Q8r528 mus musculu 
Q8hzn7 gorilla gor 
Q951hl macaca fasc 
Q8bq97 mus musculu 
Q9pvg0 carassius a 
Q8qfr6 xenopus lae 
Q9byx5 homo sapien 
Q7z3s0 homo sapien 
Q8bvw4 mus musculu 
Q8bmh9 mus musculu 
Q9pvf9 carassius a 
Q8iuzl homo sapien 
Q8hzn8 pan troglod 

Q8qgq4 carassius a 
Q8n5s7 homo sapien 

Q8bkkl mus musculu 
Q8hzn3 papio hamad 
Q8hzn4 cercopithec 
Q7tmv7 mus musculu 
Q7tt86 rattus norv 
Q8axm7 carassius a 
Q8nel0 homo sapien 

Q8byil mus musculu 
Q8hzn6 pongo pygma 



ALIGNMENTS 



RESULT 1 
Q99MT6 

ID Q99MT6 PRELIMINARY; PRT ; 317 AA. 

AC Q99MT6; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 



DE G-protein coupled receptor GPR91. 

GN GPR91. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL; 

RX MEDLINE=21172992; PubMed=11273702 ; 

RA Wittenberger T . , Schaller H.C., Hellebrand S.; 

RT "An expressed sequence tag (est) data mining strategy succeeding in 



RT the discovery of new g-protein coupled receptors."; 
RL J. Mol. Biol. 307:799-813(2001). 



CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS." 

DR EMBL;* AF295367; AAK01867.1; -. 

DR MGD; MGI: 1934135; Gpr91. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P: G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCRJRhodpsn . 



DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 317 AA; 36701 MW; 4E22F0608F92 8E36 CRC64; 

Query Match 99.0%; Score 1634; DB 11; Length 317; 

Best Local Similarity 99.1%; Pred. No. 8.8e-133; 

Matches 314; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNWVVFGYLFCMKNWNSSNVYLFN 60 

I I I I I I I I I I I I I I I I I I M I I II I I I I I I II I I I I I II I I I M I I I I I I I I I I II I I I I 
Db 1 MAQNLSCENWLATEAI LNKYYLSAFYAIEFIFGLLGNVTVVFGYLFCMKNWNSSNVYLFN 60 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFI SMDRYLLM 120 

I I I I M I I I I I I I I I I I I I I I I I I I | || || | | | | | | || :| | | | | | | | | | | | 

Db 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSMLLLTVISMDRYLLM 120 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I I I I I I II I M I I I I I I I I I I I I I I I I || I II I I I II II I I I I I || | | | | | | | | | | | | | 
Db 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAWIFSIL 24 0 

N I I I I I I I I I I II I I I II I I I II I I I I I I I I I I I I I I I I I | | | | | | | M I I I I I I I I I I 
Db 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

Qy 241 FTPYHIMRNLRIASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREM 300 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I || I I | | | | | || | | | | | | | | | | | | || | | | 
Db 241 FTPYHIMRNLRIASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREM 300 

Qy 301 LISKFRQYFKSLTSFRT 317 

I I I I I I I I I I I I I I II I 
Db 301 LISKFRQYFKSLTSFRT 317 



RESULT 2 
Q9BXA5 



ID Q9BXA5 PRELIMINARY; PRT; 330 AA. 

AC Q9BXA5; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2 001 (TrEMBLrel. 17, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE G-protein coupled receptor 91. 

GN GPR91. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21172992; PubMed-11273702 ; 

RA Wittenberger T., Schaller H.C., Hellebrand S.; 

RT "An expressed sequence tag (est) data mining strategy succeeding in 

RT the discovery of new g-protein coupled receptors."; 

RL J. Mol. Biol. 307:799-813(2001). 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

DR EMBL; AF348078; AAK29080.1; -. 

DR GO; GO: 0016021; C:integral to membrane; NAS . 

DR GO; GO: 0016526; F: G-protein coupled receptor activity, unknow. . .; NAS. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 330 AA; 38283 MW; 6D3C70E340866BAB CRC64 ; 



Query Match 75.1%; Score 1238.5; DB 4; Length 330; 

Best Local Similarity 72.2%; Pred. No. l.le-98; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNWVVFGYLFCMKNWNSSNVYLFN 60 

II I = I = I I » I II I I i Ml: hill : I I I I I I I : I I I I 

Db 1 MAWNATCKNWLAAEAALEKYYLSIFYGIEFVVGVXGNTIVVYGYIFSLKNWNSSNIYLF 60 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

I I : I I I M II I I : I I : I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I : : 

Db 61 LSVS DLAFLCTLPMLI RS YANGNWI YGDVLCI SNRYVLHANLYTS I LFLTFI S I DRYLI I 120 

QY 121 KYPFREHFLQKKEFAI LI SLAVWALVTLEVLPMLTFINSVPKEEGSNCI DYAS S GNPEHN 180 

I I I I I I I I I I I I I I I I I I I I : I I I I I I : I I : | | | I : | : | | : | | | | : | : | 
Db 121 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 180 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWIAlWIFSIL 240 

I I I I : II I I I I I I I II I I I I I II I : : I I I : I :: I I I I I ! I : : I | | : : | | | | | | : | 
Db 181 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 24 0 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I I I I I : I I I = I I I I II I I I III I I I : I I I M I I I I I I I : I I I I : I I I : I : 



Db 



241 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 300 



Qy 300 MLISKFRQYFKSLTSF 315 

M :: : I I I I I I II 
Db 301 MLMNQLRHNFKSLTSF 316 



RESULT 3 
Q8TDQ8 

ID Q8TDQ8 PRELIMINARY; PRT; 334 AA. 

AC Q8TDQ8; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 
DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 
DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE P2Y purinoceptor 1 (G protein-coupled receptor 91) . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang W., Li N., Wan T., Cao X.; 

RT "Human P2Y purinoceptor 1."; 

RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases. 
RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

DR EMBL; AF247785; AAL95690.1; -. 

DR EMBL; BC030948; AAH30948.1; -. 

DR Genew; HGNC:4542; GPR91. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 334 AA; 38697 MW; 33146E1AD87F0E81 CRC64; 

Query Match 75.1%; Score 1238.5; DB 4; Length 334; 

Best Local Similarity 72.2%; Pred. No. l.le-98; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

QY 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

M I : I : II I I M I I I I M II III: I : I I I I I : I I : I : I I I I I M : I I I I 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIVVTGYIFSLKNWNSSNIYLFN 64 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

11:11 I I I I I I I : M : I I I I I I I I I I I I I I I I I I IMMIII :||||:: 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 



Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I I I I I I I I I I II I I I II I I : I I I I I I : I I : I II I : I : | I : I I II : | : | 
°k 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDpNYN 184 

QY 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAVVIFSIL 240 

I I M : I I I I I I I I I I I MINIM: :|||:|::| MINIMI I I : : I I I I | I : I 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

N I II M II M II II I II I III I I I M II I I I I I I I || : I II I : I I I : I : 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 304 

Qy 300 MLISKFRQYFKSLTSF 315 

I I : : : I M II I II 
Db 305 MLMNQLRHNFKSLTSF 320 

RESULT 4 
Q90X57 



ID Q90X57 PRELIMINARY; PRT; 361 AA. 

AC Q90X57; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel . 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE P2Y1 nucleotide receptor. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI _TaxID=8355 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Cheng A.W., Tsim K.W. ; 

RT "Cloning of Xenopus P2Y1 Receptor."; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF432354; AAL27614.1; -. 

DR GO; GO: 0016021; Crintegral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR • GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS 5 02 62; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 361 AA; 41002 MW; E5B2D605F5B57FED CRC64; 



Query Match 29.2%; Score 482.5; DB 13; Length 361; 

Best Local Similarity 36.7%; Pred. No. 1.6e-33; 

Matches 114; Conservative 58; Mismatches 124; Indels 15; Gaps 8; 

Qy 1 MAQNLSCENWLATEAI LNK YYLSAFYAI EFI FGLLGNVT WFGYLFCMKNWNS SN 55 

•I II MM Mill: I I M I :: ::| || |:| : 

Db 18 LAS GS SAGN- -VTKCSLTKTGFQFYYLPAVYI WCITGFI GNS VAIWMFI FHMKPWS SIS 75 



Qy 


56 


Db 


76 


Qy 


115 


Db 


136 


Qy 


174 


Db 


196 


Qy 


233 


Db 


253 


Qy 


291 


Db 


313 



II : I I I : : : I I 



I I I I II 



I I I 



I I 



I I I II I 



I I 



I I I : I I : II II 



I : 



III: 



I : I 



: I I 



I : I I 



I : I 



I I I 



I : I I II III 



I I 



23, Created) 

23, Last sequence update) 
25, Last annotation update) 



RESULT 5 
Q8BMJ5 

ID Q8BMJ5 PRELIMINARY; PRT; 373 AA. 

Q8BMJ5; 

01-MAR-2003 (TrEMBLrel. 
01-MAR-2003 (TrEMBLrel . 
01-OCT-2003 (TrEMBLrel. 
P2Y purinoceptor 1. 
P2RY1. 

Mus mus cuius (Mouse) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=C57BL/6J; TISSUE=Body; 
MEDLINE=22354 683; PubMed=12466851 ; 
The FANTOM Consortium, 

the RIKEN Genome Exploration Research Group Phase I & II Team; 
"Analysis of the mouse transcriptome based on functional annotation of 
60,770 full-length cDNAs . " ; 
Nature 420:563-573(2002). 
EMBL; AK030759; BAC27125.1; -. 
MGD; MGI: 105049; P2ryl. 

GO; GO: 0016021; C: integral to membrane; IEA. 
GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 
InterPro; IPR000276; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 
PROSITE; PS50262; G_PR0TEIN_RECEP_F1 2; 1. 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
SQ 



SEQUENCE 373 AA; 42228 MW; BA88 124B7 847287C CRC64; 



Query Match 29.0%; Score 479; DB 11; Length 373; 

Best Local Similarity 37.6%; Pred. No. 3.2e-33; 



Matches 108; Conservative 54; Mismatches 117; Indels 



8; Gaps 



7; 



Qy 


20 


YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 
1 1 1 1 I : 1 1 I I | | : : 1 : 1 1 | 1 : : I I : 1 1 1 : : : 1 I : : | | | | | | 
YYLPAVYI LVFI IGFLGNSVAIWMYVFHMKPWSGI SVYMFNLALADFLYVLTLPALl FYY 


79 


Db 


52 


111 


Qy 


80 


ANDKG-TYGDVLCISNRYVXHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 
1 = 1 1 : 1 1 : : 1 1 1 1 1 1 1 1 1 1 1 1 II : II : | | : | | : 
FNKTDWI FGDAMCKLQRFI FHVNLYGS I LFLTCI SAHRYS GWYPLKSLGRLKKKNAI YV 


138 


Db 


112 


171 


Qy 


139 


SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 


197 


Db 


172 


i : 1 1 : 1 : : 1 : 1 : : 1 : II 1 : 1 1 1 : I I : 1 III 
SVLVWLIVWAISPILFYSGTGTRKNKTVTCYDTTSNDYLRSYFIYSMCTTVAMFCIPL- 


230 


Qy 


198 


WCFFYYKMWFLKRRSQQQATALPL-DKPQRLVVLAWIFSILFTPYHIMRNLRIASRL 
1 : 1 : : 1 1 II 1 1 1 : : : : 1 : : : | : | : | : : : : | | 
VLILGCYGLIV — KALIYNDLDNSPLRRKSIYLVIIVLTVFAVSYIPFHVMKTMNLRARL 


256 


Db 


231 


288 


Qy 


257 


D- SWPQGCT-QKAI KS I YTLTRPLAFLNSAINPI FYFLMGDH YREML 301 




Db 


289 


1 I : 1 : : 1 : 1 1 1 1 1 1 1 : : 1 1 1 1 1 I I : I 1 
DFQTPEMCDFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 335 





RESULT 6 
Q9DE05 

ID Q9DE05 PRELIMINARY; PRT; 357 AA. 

AC Q9DE05; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE P2Y receptor. 

OS Raja erinacea (Little skate) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes ; 

OC Elasmobranchii; Squalea; Hypnosqualea; Pristiora j ea; Batoidea; 

OC Rajiformes; Rajidae; Raja. 

OX NCBI_TaxID=7782; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RX MEDLINE=2 0459151; PubMed=10 900200 ; 

RA Dranoff J. A. , O'Neill A.F., Franco A.M., Cai S.Y., Connolly G.C., 

RA Ballatori N., Boyer J.L., Nathanson M.H.; 

RT "A primitive ATP receptor from the little skate Raja erinacea."; 

RL J. Biol. Chem. 275:307 01-30706(2000). 

DR EMBL; AF242850; AAG42684.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO:0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P : G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPROOO'276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PRO 02 37; GPCRRHODOPSN . 

DR PROSITE; PS 002 37; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS502 62; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 357 AA; 41239 MW; 14 604EE15DCBDB4 1 CRC64; 



Query Match 27.9%; Score 461; DB 13; Length 357; 

Best Local Similarity 35.0%; Pred. No. l.le-31; 

Matches 110; Conservative 64; Mismatches 112; Indels 28; Gaps 11 

QY 17 LNK- YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTL 72 

Ml Ml I I I : I : I I : : : : I I : | : | : | : | | | : : | : : : | 

Db 29 LNKGFQFYYLP IMYI I VFVTGFI GNSVALWMFI FHMRPWS S ITI YMFNLVLADLFYVFS L 88 

73 PI LI KS YANDKG-T YGDVLCI SNRYVLHTNLYT S ILFLTFI SMDRYLLMKYP FREHFLQK 131 
IMI II :|::U I I I I I I I I II I I I : | | : :| : | 

Db 89 PILIFYYFNKTDWIFGELLCKLXRFIFHVNLYGSILFLTCISVHRYTGWHPMKSLGRLK 148 

Qy 132 KEFAILISLAVW7VLVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLL 190 

I : I : : : I I I : I : I I : : : : : | | | | | | | : | 

Db 149 KKSATIVCVCVWITVMAGISPILYFSRTGLRRNKTNTCYDTTSKELLETYFI YSMSTTFF 208 

Qy 191 GFLIPLS VMCF-FYYKMWFLKRRSQQQATALPLDKPQRLVVLAVVIFSILFTPYHIM 247 

llll: : : I : I I I I I I I I I : : : : | : | : | : | : | 

Db 209 GFCIPFATILVCYGFIVKALI SNDMKT PL - RGKS VRLVI I VLAVFAI S YL P FHVM 262 

Qy 248 RNLRIASRLDSWPQG— CT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLIS 303 

=11 : III : II I : : : I :M II III ::|| Ml || :| 
Db 263 KNLNLQSRL--YYQGLDTCEWNRRVYATYQVTRGLASLNSCVDPILYFLAGDTFR 315 

Qy 304 KFRQYFKSLTSFRT 317 

I:: : : I I 
Db 316 . — RRFTNAASRFMT 327 



RESULT 7 
057466 

ID 057466 PRELIMINARY; PRT; 374 AA. 

AC 057466; 

DT 01-JUN-1998 (TrEMBLrel . 06, Created) 

DT 01-JUN-1998 (TrEMBLrel. 06, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE G protein coupled P2Y nucleotide receptor. 

GN TP2Y. 

OS Meleagris gallopavo (Common turkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Meleagris. 

OX NCBI_TaxID=9103; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Blood; 

RX MEDLINE=98 08 6419; PubMed=9415702 ; 

RA Boyer J.L., Waldo G.L., Harden T.K.; 

RT "Molecular cloning and expression of an avian G protein-coupled P2Y 

RT receptor."; 

RL Mol. Pharmacol. 52:928-934(1997). 

DR EMBL; AF031897; AAC60339.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm 1; 1. 



DR PRINTS; PR00237; GPCRRHODOPSN. 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS 5 02 62; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 374 AA; 42594 MW; 84 9C4 65722BDD02B CRC64; 

Query Match 25.6%; Score 422.5; DB 13; Length 374; 

Best Local Similarity 33.7%; Pred. No. 2.4e-28; 

Matches 106; Conservative 59; Mismatches 119; Indels 31; Gaps 3 

Q y 10 WL ATEA I LNKYY LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVY 57 

II I II : |: : | I I |: II | : : : : | : | | : : I 

Db 20 WLGGNTTAAAEAKCVFNEEFKFILLPISYGIVFWGLPLNSWAMWIFVSRMRPWNATTTY 79 

^ 58 LFNLSISDFAFLCTLPILIKSYAN-DKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDR 116 

: I I I : I I I : = : I I I : I I : : : I I I I : : : | | | : | | | | | | | | : | 
Db 90 MFNLAI SDTL WFSLPTL VYYYADRNNWPFGKVFCKI VRFLFYANLYS S I LFLTCI SVHR 139 

117 YLLMKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGN 176 
I = = : I I I : I I I : I I : I I : : : I | I : : | : : | | 

Db 140 YMGICHPIRSLKWVKTKHARLICVGWLVVTICLIPNLIFVTTSSKDNSTLCHDTTKPEE 199 

QY 177 PEHNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQ 227 

• I : I I : I I I I I : I I | : | | | | : 

Db 2 00 FDHYVHYS S S IMALLFGI PFLVI WCYCLMAKRLCKRS FPSPSPRVPSYKKRSI 253 

QY 228 RLWLAWIFSILFTPYHIMRNLRIASRLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAIN 286 

::::::: I : I I I : I I I I II : | | | | : | | | | | : | | : : 
Db 254 KMI 1 1 VLTVFAICFVPFHITRTLYYTSRY — FQADCQTLNI INFTYKITRPLASINSCLD 311 

Qy 2 87 PIFYFLMGDHYREML 301 

lllhllll I 
Db 312 PILYFMAGDKYRGRL 326 



RESULT 8 
Q7ZZA4 

ID Q7ZZA4 PRELIMINARY; PRT; 347 AA. 

AC Q7ZZA4; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE SI:bZ46J2.9 (Novel protein similar to nucleotide receptors). 

GN SI:BZ46J2.9. 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae ; Danio . 

OX NCBIJTaxID=7955 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Skuce C. ; 

RL Submitted (FEB-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AL590151; CAD68067.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F:receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 



DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN__RECEP_Fl_l ; 1. 

DR PROSITE; PS 5 02 62; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 347 AA; 39861 MW; 3D3C0 1F8 3CC2 83E2 CRC64; 



Query Match 25.2%; Score 416; DB 13; Length 347; 

Best Local Similarity 35.7%; Pred. No. 8.1e-28; 

Matches 105; Conservative 57; Mismatches 116; Indels 16; Gaps 9; 



Qy 


19 


KY-YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 
II 1 1 : : 1 1 1 : 1 : : : : 1 : I I I I : I : | : : | | : : : | | : | | 
KYILLPVSYSLVCFFGLILNSVALWMFITKMRPWKPSTVYMFHLALSDTLYVLSLPMLIY 


77 


Db 


19 


78 


Qy 


78 


S YAN- DKGT YGDVLCI SNRYVLHTNLYT S I L FLTFI SMDRYLLMKYP FREHFLQKKEFAI 

IN =1111 1 : : : 1 1 1 1 1 1 1 1 1 1 1 : 1 | | : : | | | | | 
YYANRSPiWPFGWLCKIVRFLFYANLYCSILFLTCISVHRYLGICHPIRSLTLIKPRHAH 


136 


Db 


79 


138 


Qy 


137 


LISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPE—HNLI-YSLCLTLLGFL 


193 


Db 


139 


• : M 1 : : 1 1 : 1 : : 1 1 : II 1 1 : I : : : I | : 
MVCGFVWTAVIACLVPTLILVNTSRNGNSTLCHD TSRPEEFHNFVTYNSWMVLLFI 


195 


Qy 


194 


I PLSVMCFFYYKMWFL KRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRN 

: 1 1 : 1 1 1 1 : 1 1 : : 1 : 1 : : : : I : I : | | | : | | | 
LPFLVIWCYCLMARALCQPRKGLAQNQQSS-SRKKSIKLIIWLWFAICFVPFHITRT 


249 


Db 


196 


254 


Qy 


250 


LRIASRLDSWPQGC-TQKAIKSI YTLTRPLAFLNSAINPIFYFLMGDHYREMLI 302 
1 II' : M : 1 : 1 1 1 M : 1 1 :: 1 1 1 1 1 1 1 I I I I : 
LYYAYRI — FDADCKTLNIVNFSYKITRPLASVNSCLDPILYFLAGDHYRSKLL 306 




Db 


255 





RESULT 9 
Q7ZWQ7 

ID Q7ZWQ7 PRELIMINARY; PRT; 537 AA. 

AC Q7ZWQ7; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Embryo; 

RA Klein S., Strausberg R. ; 

RL Submitted (FEB-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC046837; AAH46837.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 



DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS 00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS502 62; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 537 AA; 61981 MW; B030F67D76CB9622 CRC64 ; 

Query Match 23.4%; Score 386.5; DB 13; Length 537; 

Best Local Similarity 35.3%; Pred. No. 4.3e-25; 

Matches 104; Conservative 51; Mismatches 125; Indels 15; Gaps 7; 

22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSYAN 81 

I i : 1:11 I : : : : : I : I I : I I : I I I : : I I : : : | | | : | | 
46 LPVSYSAVFMVGLPLNIAAMWIFIAKMRPWNPTTVYMFNLALSDTLYVLSLPTLVYYYA- 104 

82 DKGT YGDVLC I SNRYVLHTNL YTS I L FLT FI SMDRYLLMKYP FREHFLQKKEFAI LIS 139 
M : I : I I I I : : : I I I : I I i I I I I I : I I : : | : I : I 

105 DKNNWP FGEVLCKLVRFL FYANL YS S I L FLT CI S VHRYRGVCHP I T S LRRMNAKHAYVI C 164 

140 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

I I III ::| I I: II : : I I : : || : | | || :: 

165 ALWLS VTLCLVPNLI FVTVS PKVKNTI CHDTTRPEDFARYVEYSTAIMCLLFGI PCLI I 224 

200 CFFYYKMWFLKRR SQQQATALPLDKPQ — RLWLAWIFSILFTPYHIMRNLRIASR 255 

I I I : I I I II I : : :: :: |:| | |:|| | | -| 
225 AGCYGLMTRELMKPIVSGNQQT-LPSYKKRSIKTIIFVMIAFAICFMPFHITRTLYYYAR 283 

256 LDSWPQGCTQKAIKSI YTLTRPLAFLNSAINPIFYFLMGDHYREMLI SKFRQ 307 

I I I : I I =11111 I I I : I I I || IN | | | : 

284 L LGI KC YALNVINVTYKVTRPLASANSCI DPI LYFLANDRYRRRLI RTVRR 334 



QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



RESULT 10 
Q8VHP3 

ID Q8VHP3 PRELIMINARY; PRT; 361 AA. 

AC Q8VHP3; 

DT 01-MAR-2002 (TrEMBLrel. 20, Created) 

DT 01-MAR-2002 (TrEMBLrel. 20, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC-chemokine receptor 4. 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI JIaxID=l 014 1 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Jopling L.A. , Sabroe I., Andrew D.P., Mitchell T.J., Li Y., 

RA Hodge M.R., Williams T.J., Pease J.E.; 

RT "The identification, characterization and distribution of guinea pig 

RT CCR4 and epitope mapping of a blocking antibody."; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF431971; AAL57488.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO:0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 



DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 361 AA; 41064 MW; 9304E8 97D4FD68 39 CRC64; 



Query Match 21.4%; Score 353; DB 11; Length 361; 

Best Local Similarity 30.0%; Pred. No. 2.2e-22; 

Matches 86; Conservative 62; Mismatches 131; Indels 8; Gaps 5; 



Qy 


21 


YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNS-SNVYLFNLSISDFAFLCTLPILIKSY 
: 1 1 : : 1 : 1 1 1 1 1 1 | | | | | I : : I I | | | : | | | | : : | | | 
£ urrui olivt uc ^ v v v jj— vht J\Y KRLR5M1 DVYLLNLAI SDLLFVLSLPFW-GYY 


79 


Db 


42 


99 


Qy 


80 


ANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILIS 


139 


Db 


100 


i 1 : : 1 : 1 : : 1 : I I : : 1 : I I I I : : : : : : : | 
AADQWVFGLGVCKMI S WI YLVGF YS GI FFI VLMS I DRYLAI VHGVFSMRVRT FT YGVI T S 


159 


Qy 


140 


LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 
IIM= II 1 1 : ::| |: : : : 1 1 : : I I : | | | : | 
IATWAVAVFASLPGLLFSTCYTERNHTSCKTRYSANSTTWKVLSSLEINILGLVIPLGIM 


199 


Db 


160 


219 


Qy 


200 


CFFYYKMVVFLKRRSQQQATALPLDKPQRLVVIiAWIFSILFTPYHIMRNLRIASRLDSW 


259 


Db 


220 


Ml:: | : | : : | : : s | | : | : | | | : | : | | : 
-LFCYSMII RTLQHCKSKKKNKAVKMIFAVWLFLGFWTPYNIVLFLYTLVELEVL 


274 


Qy 


260 


PQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFR 306 

: 1 : 1 III:: : 1 1 1 M : 1 : : 1 : : : | : 
QDCSLEKYLDFALQATETLAFIHCCLNPIIYFFLGEKFRKYIVQLFK 321 




Db 


275 





RESULT 11 




Q8MJW8 




ID 


Q8MJW8 PRELIMINARY; 


PRT; 360 AA. 


AC 


Q8MJW8; 




DT 


01-OCT-2002 (TrEMBLrel. 22, 


Created) 


DT 


01-OCT-2002 (TrEMBLrel. 22, 


Last sequence update) 


DT 


01-JUN-2003 (TrEMBLrel. 24, 


Last annotation update) 


DE 


CC chemokine receptor 4 . 


GN 


CCR4. 




OS 


Canis familiaris (Dog) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 


OC 


Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 


OX 


NCBI TaxID-9615; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RA 


Maeda S., Okayama T., Masuda K., Ohno K . , Tsujimoto H-; 


RT 


"Detection of CC chemokine 


receptor 4 (CCR4) mRNA expression in canine 


RT 


atopic skin lesion. 1 '; 


RL 


Submitted (FEB-2002) to the 


EMBL/ GenBank/DDBJ databases. 


DR 


EMBL; AB08O188; BAC10546.1; 




DR 


GO; GO:0016021; C:integral 


to membrane; IEA. 


DR 


GO; GO:0004872; F:receptor 


activity; IEA. 


DR 


GO; GO: 0001584; F:rhodopsin 


-like receptor activity; IEA. 


DR 


GO; GO:0007186; P:G-protein 


coupled receptor protein signalin. . .; IEA. 


DR 


InterPro; IPR000276; GPCR Rhodpsn. 



DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 360 AA; 41354 MW; 69115F5209EC0908 CRC64; 



Query Match 20.8%; Score 343; DB 6; Length 360; 

Best Local Similarity 29.3%; Pred. No. 1.6e-21; 

Matches 98; Conservative 59; Mismatches 137; Indels 40; Gaps 10; 



Qy 


14 


EAI LNKYYL SAF — YAIEFI FGLLGNVT WFGYLFCMKNWN 
MINI M 1:: hllllll :|| III 
ESIYNNYYLYENIPKPCTKEGIKAFGELFLPPLYSLVFLFGLLGN-SVWWLFKYKRLK 


52 


Db 


13 


71 


Qy 


53 


S-SNVYLFNLSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTF 
1 : : 1 1 1 1 1 : II 1 1 : : I I | | | : : | | | : : | : | | : 
brai jjv YLLNLA1 bDLLFVLSLPFW-GYYAADQWVFGLGLCKII SWMYLVGFYSGI FFIML 


111 


Db 




130 


Qy 


112 


ISMDRYLLMKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDY 


171 


Db 


131 


= 1 : 1 1 1 1 : : : : : : | | | | : : | | | | | : : | 
MSIDRYLAIVHAVFSLRARTLTYGVITSLATWSVAVLASLPGLLFSTCYTERNHTYCKTK 


190 


Qy 


172 


ASSGNPEHNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLW 


231 


Db 


191 


1 : M : :li MM 1 1 1 h: |: | | l:| 
YSRNSTRWKVLSSLEINILGLVIPLGTM-LFCYSMII RTLQHCKNEKKSKAVRMVF 


245 


Qy 


232 


LAWIFSILFTPYHIMRNLRIASRLDSWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFY 
1 ; l : II::: 1 |: 1 II :: : 1 III:: :||: | 
AW7VLFLGFWAPYNVVLFLETLVELEVL-QDCTFERHLDYAIQATETLAFVHCCLNPVIY 


290 


Db 


246 


304 


Qy 


291 


FLMGDHYREMLISKFR QYFKSLTSF 315 




Db 


305 


1 : 1 : : 1 : 1 : 1 : 1 1 : 1 : 
FFLGEKFRKYLVQLFKTCRGPFMLCQYCRLLQMY 338 





RESULT 12 
Q9ERK9 



ID Q9ERK9 PRELIMINARY; PRT; 328 AA. 

AC Q9ERK9; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE P2Y6 receptor (Hypothetical protein) . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvEv; 

RX MEDLINE=21160052; PubMed=1125952 6 ; 

RA Lazarowski E.R., Rochelle L.G., O'Neal W.K., Ribeiro CM. P., 

RA Grubb B.R., Zhang V., Harden T.K., Boucher R.C.; 

RT "Cloning and functional characterization of two murine uridine 

RT nucleotide receptors reveal a potential target for correcting ion 

RT transport deficiency in cystic fibrosis gallbladder."; 



RL J. Pharmacol. Exp. Ther. 297:43-49(2001). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (APR-2002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AF298899; AAG24619.1; 

DR EMBL; BC027331; AAH27331.1; 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F:receptor activity; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS50262; G_PROTEIN_RECEPJTl_2 ; 1. 

KW Hypothetical protein; Receptor. 

SQ SEQUENCE 328 AA; 36721 MW; 00F9DF5ADADF903E CRC64; 

Query Match 20.5%; Score 338; DB 11; Length 328; 

Best Local Similarity 29.6%; Pred. No. 4e-21; 

Matches 84; Conservative 54; Mismatches 136; Indels 10; Gaps 4; 

22 LSAFYAI EFI FGLLGNVTVVFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS YA- 80 
I : i : : : I I 1:1: : III I I : : : I : I : I I : I I : | | 

Db 29 LTPVYSWLWGLPLNICVIAQICASRRTLTRSAVYTLNLALADLMYACSLPLLIYNYAR 8 8 

QY 81 NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFRE-HFLQKKEFAILIS 139 

I : M : I f : : : I | : | | | | i , | | | I | : : | | ; | - . 

Db 89 GDHWPFGDLACRFV^FLFYANLHGSILFLTCISFQRYLGICHPIASWHKRGGRRAAWWC 148 

Qy 140 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

II M:|| I : : : I I : I I : I I : : | | | : | : 

Db 149 GWWLAVTAQCLPTAVFAATGIQRNRTVCYDLSPPILSTRYLPYGMALTVIGFLLPFIAL 208 

Qy 200 CFFYYKMWFLKRRSQQQATALPL DKPQRLWLAWI FS I LFTP YHIMRNLRI AS 254 

I : I : I : I I I : I I : I : : I : I I I : I I : : | 

Db 209 LACYCRMA RRL C RQ D G PAG P VAQE RRS KAARMAVWAAVFAI S FL P FH I T KT AYLAV 265 

QY 255 RLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYR 298 

I : : I I I I I : I I : : I I : : : | 

266 RSTPGVSCPVLETFAAAYKGTRPFASVNSVLDPILFYFTQQKFR 309 



RESULT 13 
Q9MYJ8 

ID Q9MYJ8 PRELIMINARY; PRT; 355 AA. 

AC Q9MYJ8; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Chemokine receptor. 

GN CCR1 . 

OS Callithrix jacchus (Common marmoset) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Platyrrhini; Callitrichidae; Callithrix. 

OX NCBI_TaxID=94 83; 

RN [1] 



RP SEQUENCE FROM N.A. 

RX MEDLINE=20153429; PubMed=106862 94 ; 

RA Liang M. , Rosser M.P., Ng H.P., May K. , Bauman J.G., Islam I., 

RA Ghannam A., Kretschmer P.J., Pu H., Dunning L., Snider R.M. , 

RA Morrissey M.M., Hesselgesser J., Perez H.D., Horuk R. ; 

RT "Species selectivity of a small molecule antagonist for the CCR1 

RT chemokine receptor."; 

RL Eur. J. Pharmacol. 389:41-49(2 000). 

DR EMBL; AF127528; AAF36453.1; 

DR GO; GO: 0016021; Crintegral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . 



DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS502 62; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 355 AA; 40928 MW; 2B01C47E9874A2C1 CRC64; 



Query Match 20.3%; Score 335; DB 6; Length 355; 

Best Local Similarity 28.1%; Pred. No. 7.8e-21; 



Matches 85; Conservative 63; Mismatches 138; Indels 16; Gaps 7; 

12 ATEAI LNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCT 71 
II I I:: I: l|:||: || : : | S ::|| ||:|M || | 

Db 27 ANERAFGAKLLPPLYSLVFVI GLVGNI LVVVVLVQ YKRLKNMTS I YLLNLAI SDLLFLFT 8 6 

Qy 72 LPILIKSYANDKGTYGDVLC--ISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFL 129 

N I : I : : I : I I : I I I : I I : : : : | | | | : : 

Db 87 lp™isyqlktdwvfgnamckvlsgfy--ytglyseiffiilltidrylaivhavfalra 144 

Qy 130 QKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNC-IDYASSGNPEHNLIYSLCLT 188 

: I I: :l I I I I I I : : I : : | | :| | 

Db 145 RTVTFGVITSIIIWVLAI LASLPGLYFAKTQWEITHRTCSLHFPHESRQEWKLFQALKLN 204 

Qy 189 LLGFLI PLSWICFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAVVI FS I LFTP YHIMR 248 

Ml ::|| II I :: III :: : | I I : : : : | | : : | | | : : 

Db 205 LLGLVLPLLVMI VCYTGI I KI LLRRPNEKKS KAVRLIFVIMI IFFLFWTPYNLTT 259 

Qy 249 NLRI ASRLDSWPQGCTQ- KAI KS I YTLTRPLAFLNSAINPI FYFLMGDHYREMLI SKFRQ 307 

: : : Ml:: : I : I : : : I I : I : | : : | : | | | 

Db 2 60 LISVFQDF-LFTYGCEQGRQLDLAIQVTEMIAYTHCCVNPVIYAFVGERFRKHL RQ 314 

Qy 308 YF 309 

I 

Db 315 LF 316 



RESULT 14 
Q9JLY8 

ID Q9JLY8 PRELIMINARY; PRT ; 355 AA. 

AC Q9JLY8; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 



DE 
OS 
OC 



Macrophage inflammatory protein-1 alpha receptor. 



DR 
DR 
DR 



Rattus norvegicus (Rat) 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 
OX NCBI_TaxID=10116; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=CD; 

RX MEDLINE=20555330; PubMed=11091494 ; 
RA Waller A., Nayee P., Czaplewski L.G.; 

RT "Identification and characterization of a rat macrophage inflammatory 
RT protein-1 alpha receptor."; 

RL J. Hematother. Stem Cell Res. 9:703-710(2000). 
DR EMBL; AF119381; AAF34340.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 
GO; GO: 0004872; F: receptor activity; IEA. 

GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 
DR InterPro; IPR000276; GPCR_Rhodpsn . 
DR Pfam; PF00001; 7tm__l; 1. 
DR PRINTS; PR00237; GPCRRHODOPSN . 
DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 
DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 
KW Receptor. 

SQ SEQUENCE 355 AA; 40838 MW; 2FEB8 661D1E6E075 CRC64; 

Query Match 20.2%; Score 333; DB 11; Length 355; 

Best Local Similarity 28.6%; Pred. No. 1.2e-20; 

Matches 85; Conservative 69; Mismatches 119; Indels 24; Gaps 9; 

22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSYAN 81 
I I : II I : : I I : I : : : : : : : || I I I : : I I II Ml I 
Db 3 7 LPPLYSFVFIIGWGNILVI LVLMQHRRLQSMTSI YLFNLAVSDLVFLFTLPFWIDYKLK 96 

82 DKGTYGDVLC--ISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILIS 139 
I :M :| :| I I I I : I I : : : : I || | : : : | | : | 

Db DNWVFGDAMCKLLSGFYYL — GLYSEI FFI ILLTIDRYLAIVHAVFSLRARTVTFGI ITS 154 

Qy 140 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNL IYSLCLTLLGFLI 194 

: sill I M I I : : I I |: :| :| I Ml :: 

Db 155 1 1 IWALAI LASI PALCFFKAQWEFTHHTC SPHFPDESLKTWKRFQALKLNLLGLIL 210 

Qy PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAVVI FS I LFT P YHIMRNLRI A 253 

I I I I I : : : | : | : : : : | | : II : : : I : I : | | | : : : : : 

Db 211 PLLVMI I C YAGI I RI LLRRPNEKKAKAV RLIFAITLLFFLLWTPYNL — TVFVS 262 

Qy 254 SRLD-SWPQGCTQ-KAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQY 308 

: I : Ml: : I : I : : Mill M : M : I I : : : 

Db 263 AFQDVLFTNQCEQSKQLDLAI QVTEVIAYTHCCVNPII YVFVGERFRKYLRQLFQRH 319 



RESULT 15 
Q86VA9 

ID Q86VA9 PRELIMINARY; PRT; 374 AA. 

AC Q86VA9; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 



DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE CCR1 protein (Fragment) . 

GN CCR1 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Blood; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L . , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T.B., Tpshiyuki S., Carninci P., Prange C ," 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C, Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C, 

RA Rodriguez A.C, Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E.',"' 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Blood; 

RA Strausberg R. ; 

RL Submitted (APR-2 003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC051306; AAH51306.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1~2 ; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 374 AA; 43400 MW; 5A37275 08DD8 93E5 CRC64 ; 

Query Match 19.9%; Score 328; DB 4; Length 374; 

Best Local Similarity 29.0%; Pred. No. 3.3e-20; 

Matches 85; Conservative 63; Mismatches 127; Indels 18; Gaps 8; 

22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSYAN 81 
I I- I* M:ll: II : : I :::|| Ihlll , I; | 
Db 56 LPPLYSLVFVIGLVGNILWLVLVQYKRLKNMTSIYLLNLAISDLLFLFTLPFWIDYKLK 115 



Q y 82 DKGTYGDVLC--ISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILIS 139 



' ■ i i - i • i i • i i i • i i . •••mm : | : : | 

Db H6 DDWVFGDAMCKILSGFY — YTGLYSEIFFIILLTIDRYLAIVHAVFALRARTVTFGVITS 173 

Qy 140 LAVWALVTLEVLPMLTFINSVPKEEGSNC-IDYASSGNPEHNLIYSLCLTLLGFLIPLSV 198 

= :MI I :| I I : : I : : I I :| I | | ::|| | 

Db 174 IIIWAIAILASMPGLYFSKTQWEFTHHTCSLHFPHESLREWKLFQ7VLKLNLFGLVLPLLV 233 

1^9 MCFFYYKMVVFLKRRSQQQATALPLDKPQRLWLAVVI FS I LFT P YHIMRNLRI ASRLD- 257 
I I : = Ml : : : I I I : : : : I I : : I I I : : : | : | 

Db 234 MIICYTGIIKILLRRPNEKKS KAVRLI FVTMI I FFLFWTPYNL — TI LI SVFQDF 286 

QY 258 SWPQGCTQ-KAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYF 309 

: II:: : I : I : : : II : I : I : : I : | | | | 

Db 287 LFTHECEQSRHLDLAVQVTEVIAYTHCCVNPVIYAFVGERFRKYL RQLF 335 



Search completed: August 23, 2004, 17:02:52 
Job time : 119 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: August 23, 2004, 16:47:58 ; Search time 25 Seconds 

(without alignments) 
660.250 Million cell updates/sec 

Title: US-09-891-138A-2 
Perfect score: 1650 

Sequence: 1 MAQNLSCENWLATEAI LNKY REMLI SKFRQYFKSLTSFRT 317 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 141681 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SwissProt_42 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


481 


29. 


2 


373 


1 


P2YR_CAVPO 


P59902 


cavia porce 


2 


479 


29. 


0 


373 


1 


P2YR__RAT 


P49651 


rattus norv 


3 


475 


28. 


8 


373 


1 


P2YR_MOUSE 


P49650 


mus musculu 


4 


474 


28. 


7 


373 


1 


P2YR_BOVIN 


P48042 


bos taurus 


5 


474 


28. 


7 


373 


1 


P2YR_HUMAN 


P47900 


homo sapien 


6 


472.5 


28. 


6 


362 


1 


P2YR_CHICK 


P34996 


gallus gall 


7 


472.5 


28. 


6 


362 


1 


P2YR_MELGA 


P49652 


meleagris g 


8 


426 


25. 


8 


337 


1 


GP80_HUMAN 


Q96p68 


homo sapien 


9 


386.5 


23. 


4 


537 


1 


P2Y8 XENLA 


P79928 


xenopus lae 


10 


379.5 


23. 


0 


328 


1 


P2Y3_CHICK 


Q98907 


gallus gall 


11 


374 


22. 


7 


374 


1 


P2Y2_RAT 


P41232 


rattus norv 


12 


373.5 


22. 


6 


373 


1 


P2Y2_MOUSE 


P35383 


mus musculu 


13 


371.5 


22. 


5 


328 


1 


P2Y3_MELGA 


093361 


meleagris g 


14 


363.5 


22. 


0 


377 


1 


P2Y2_HUMAN 


P41231 


homo sapien 


15 


354 


21. 


5 


361 


1 


P2Y4_RAT 


035811 


rattus norv 


16 


353 


21. 


4 


355 


1 


C3X1_HUMAN 


P49238 


homo sapien 


17 


352 


21. 


3 


365 


1 


P2Y4 HUMAN 


P51582 


homo sapien 
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. 8 
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1 
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mus musculu 


19 
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rattus norv 


21 
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ft 


1 

X 


ir^lj LnlLn. 


P32250 


gallus gall 


22 
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20 , 
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J J *i 


1 

X 


(~* 1 DST 1 
tJAl KA.I 


P35411 


rattus norv 


23 
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20 , 


, 1 




1 

X 


P9VS ITT T"N/ a "NT 


Q15077 


homo sapien 


24 


329 


19 , 


, 9 


JJJ 


1 

X 




P56482 


macaca mula 


25 


328 


19 . 


. 9 


jjj 


1 

X 




P32246 


homo sapien 


26 


326. 5 


19 . 


R 

■ o 


j fi O 


1 

X 


f<T tjt TTv/f 7\ "NT 


Q9ns75 


homo sapien 


27 


326 . 5 


19 . 


, 8 




1 

X 


^n-K^i rlUUjrj 


P51680 


mus musculu 


28 


325 . 5 


19 . 


7 


j J ft 


X 


f"Ovl >X/^iT TCU 

Loa± MUUbb 


Q9z0d9 


mus musculu 


29 


325 . 5 


19 . 


7 


^ ft ft 
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X 


Q O ^ "LJT TV/TTWr 

ooK4 HUMAN 


P31391 


homo sapien 


30 


321 


19 . 


j 


j 4 *± 


1 
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Q8bmc0 


mus musculu 


31 
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19 . 


A 
*± 




1 
X 




Q95n03 


sus scrofa 


32 
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19 . 


j 


^ ^"3 
jjj 


1 
X 




P79960 


xenopus lae 


33 


318 . 5 


19 . 


3 




1 

X 


TT QD MOT T <^ 1? 

x xj o Jd 1Y1U U o 1L 


P35343 


mus musculu 


34 
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19 . 


2 


JJJ 


1 
X 


brKl nUJyiAJN 


P46091 


homo sapien 


35 


317 


19 . 


? 


^£ft 

J D O 


1 

X 




P34993 


bos taurus 


36 


316 . 5 


x _? ■ 


O 
c. 


^ ft 4 
j 0 4 


1 

X 


C C D /l T"> 7\ T> 

bbK4 KAI 


P30937 


rattus norv 


37 


315 . 5 


x y • 


1 

-L 


"3 ^ 
jjj 


1 
X 


GPR1 MACMU 


097664 


macaca mula 


38 


315 . 5 


19 . 


1 

X 


^ R R 

jjj 


1 

X 


TT Q A T"> 7\ "D ~T rp 

llioA KArJll 


P21109 


oryctolagus 


39 


315 


19 . 


1 


^ s s 

JJJ 


1 

X 


PVpQ T-TT TMSXI 

L!\Ko nUiyLAlN 


P51685 


homo sapien 


40 


314 


19 . 


o 


344 


1 


P?YS T4TTMAM 




homo sapien 


41 


314 


19. 


0 


369 


1 


SSR2~HUMAN 


P30874 


homo sapien 


42 


314 ■ 


19. 


0 


369 


1 


SSR2_PIG 


P34994 


sus scrofa 


43 


314 


19. 


0 


370 


1 


P2Y9_HUMAN 


Q99677 


homo sapien 


44 


313.5 


19. 


0 


353 


1 


IL8B_PANTR 


Q28807 


pan troglod 


45 


313.5 


19. 


0 


356 


1 


CKR8_MACMU 


097665 


macaca mula 



ALIGNMENTS 



RESULT 1 
P2YR_CAVP0 

ID P2YR_CAVP0 STANDARD; PRT; 373 AA. 

AC P59902; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor). 

GN P2RY1. 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

0C Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Hartley; TISSUE=Small intestine; 

RA Gao N., Hu H., Zhu M.X., Wood J.D.; 

RT "A novel P2Y1 receptor in the guinea pig submucous plexus."; 

RL Submitted (JUL-2001) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Receptor for extracellular adenine nucleotides such as 

CC ATP and ADP (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 



cc 


This SWISS-PROT entry is 


copyright. It is produced through a collaboration 


cc 


between 


the Swiss Institute of Bioinf ormatics and the EMBL outstation - 


cc 


the European Bioinf ormatics 


Institute. There are no restrictions on its 




use by 


non-profit institutions as long as its content is in no wav 




modified 


and this 


statement 


is not removed. Usage by and for commercial 


PP 


entities 


requires 


a license 


agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 


or send an email to license@isb-sib . ch) . 




EMBL; AY048684; AAL05953. 
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rKUbl 1 b ; 


PS00237 ; 


G_PROTEIN_ 


RECEP Fl 1; 1 


UK 


JrKUoIIE; 
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FT 
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FT 


DOMAIN 


150 


167 




CYTOPLASMIC ( POTF.MTT AT. 'I 


FT 


TRANSMEM 


168 


188 




4 (POTENTIAL) . 


FT 
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DOMAIN 


189 


220 




EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


221 


241 




5 (POTENTIAL) . 


FT 


DOMAIN 


242 


258 




CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


259 


279 




6 (POTENTIAL) . 


FT 
I. X 


DOMAIN 


280 


310 




EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


311 


331 




7 (POTENTIAL) . 


FT 


DOMAIN 


332 


373 




CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 


124 


202 




BY SIMILARITY. 


FT 


CARBOHYD 


11 


11 




N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


27 


27 




N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


113 


113 




N-LINKED (GLCNAC. . . ) (POTENTIAL) . 


FT 


CARBOHYD 


197 


197 




N-LINKED (GLCNAC. . .) (POTENTIAL). 


SQ 


SEQUENCE 


373 AA; 


42300 


MW 


; 2BD66543FFE3F341 CRC64; 


Query Match 




29. 2 l 


ft; 


Score 481; DB 1; Length 373; 



Best Local Similarity 36.9%; Pred. No. l.le-24; 
Matches 106; Conservative 58; Mismatches 115; Indels 8; Gaps 6; 



Qy 


20 


YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 
HI 1 1 : II 1 III :: ::| M |: : II : 1 1 I : : : | | : : | | | | | | 
YYLPAVYI VVFI I GFLGNS I AIWMFVFHMKPWSGI S VYMFNLALADFLYVLTLPALI FYY 


79 


Db 


52 


111 


Qy 


80 


ANDKG-T YGDVLCI SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYPFREHFLQKKEFAI LI 

1 : 1 1 : 1 1:: 1 HI II! | | : | | : II : I : | 

FNKTNWIFGDAMCKLQRFIFHVNLYGSILFLTCISAHRYSGWYPLKSLGRLKKKNAVYI 


138 


Db 


112 


171 


Qy 


139 


SIAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 
1 : 1 1 : 1 : : 1 : 1 : : : 1 : II I | | | : | | : | : | | 
SVLVWLIVWAISPILFYSGTGIRKNKTITCYDTTSDEYLRSYFIYSMCTTVAMFCVPLV 


197 


Db 


172 


231 


Qy 


198 


VMCFFYYKMWFLKRRSQQQATALPL- DKPQRLVVTiAWI FS I LFTP YHIMRNLRIAS RL 
: : 1 1 1 : : II 1 1 1 : : : : I : : : | : | : | : : : : | | 
LILGCYGLIVRALIYKDLDNS — PLRRKSIYLVIIVLTVFAVSYIPFHVMKTMNLRARL 


256 


Db 


232 


288 


Qy 


257 


D- SWPQGCT-QKAI KS I YTLTRPLAFLNS AINPI FYFLMGDH YREML 301 




Db 


289 


1 1: II : : 1 :M 1 1 1 1 1 ::| 1 I I I M :| | 

D FQT P EMC T FN D RVYAT YQ VT RGLAS LNS CVD P I L YFLAGDT FRRRL 335 
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RESULT 2 
P2YR_RAT 

ID P2YR_RAT STANDARD; PRT; 373 AA. 

AC P49651; 

DT 01-FEB-1996 (Rel. 33, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 01-FEB-1996 (Rel. 33, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor). 

GN P2RY1. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBIJTaxI D=l 0 116; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Insulinoma; 

RX MEDLINE=95298025; PubMed=777 9087 ; 

RA Tokuyama Y., Hara M. , Jones E.M.C., Fan Z., Bell G.I.; 

RT "Cloning of rat and mouse P2Y purinoceptors . " ; 

RL Biochem. Biophys. Res. Commun. 211:211-218(1995). 

CC -!- FUNCTION: Receptor for extracellular adenine nucleotides such as 

CC ATP and ADP. In pancreatic islets, may mediate some of the effects 

CC of extracellular ATP on insulin secretion. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Expressed in muscle, heart, liver, kidney, 

CC lung, brain, spleen, but not in testis. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U22830; AAA91303.1; -. 

DR HSSP; P34996; 1DDD. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

G-protein coupled receptor; Transmembrane; Glycoprotein. 
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FT 
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FT 
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FT 
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EXTRACELLULAR (POTENTIAL) . 


FT 
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5 (POTENTIAL) . 


FT 


DOMAIN 
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CYTOPLASMIC (POTENTIAL) . 



FT 




ibo 


285 


FT 




zoo 


303 


FT 
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11 
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FT 
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SQ 


SEQUENCE 


373 AA; 


4232 


Query Match 




29. 



6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 
N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 



) 



) 



6DDF676287B5E648 CRC64; 



(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 



Best Local Similarity 37.3%; 



Score 479; DB 1; 
Pred. No. 1.5e-24; 



Length 373; 



Matches 107; Conservative 57; Mismatches 115; Indels 



Gaps 
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RESULT 3 
P2YR_MOUSE 

ID P2YR_MOUSE STANDARD; PRT; 373 AA 

AC P49650; 

DT 01-FEB-1996 (Rel. 33, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor) . 

GN P2RY1 . ^ ' 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Insulinoma; 

RX MEDLINE=95298025; PubMed=7779087 ; 

RA Tokuyama Y. , Hara M. , Jones E.M.C., Fan Z., Bell G.I.; 

RT "Cloning of rat and mouse P2Y purinoceptors . "; 

RL Biochem. Biophys. Res. Commun. 211:211-218(1995) 

RN [2] 



RP SEQUENCE FROM N.A. 

RC STRAIN=129/Sv; 

RA Leon C. ; 

RT "Thromboresistance in P2Y1 receptor knockout mice."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDB J databases. 

CC -!- FUNCTION: Receptor for extracellular adenine nucleotides such as 

CC ATP and ADP. In pancreatic islets, may mediate some of the effects 

cc of extracellular ATP on insulin secretion. 

CC SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 



CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

Cc 

DR EMBL; U22829; AAA91302.1; 

DR EMBL; AJ245636; CAB57317.1; -. 

DR HSSP; P34996; 1DDD. 

DR MGD; MGI : 105049; P2ryl. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PRO 02 37; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 

FT DOMAIN 1 52 EXTRACELLULAR ( POTENTIAL) 

FT TRANSMEM 53 74 1 (POTENTIAL) . 

FT DOMAIN 75 87 CYTOPLASMIC (POTENTIAL) 

FT TRANSMEM 88 109 2 (POTENTIAL). 

FT DOMAIN 110 12 6 EXTRACELLULAR (POTENTIAL) 

FT TRANSMEM 127 147 3 (POTENTIAL). 

FT DOMAIN 148 166 CYTOPLASMIC (POTENTIAL). 

FT TRANSMEM 167 188 4 (POTENTIAL). 

FT DOMAIN 189 218 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 219 238 5 (POTENTIAL). 

FT DOMAIN 239 265 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 266 285 6 (POTENTIAL) . 

FT DOMAIN 28 6 303 EXTRACELLULAR (POTENTIAL) 

FT TRANSMEM 304 328 7 (POTENTIAL) . 

FT DOMAIN 329 373 CYTOPLASMIC (POTENTIAL). 

FT DISULFID 124 202 BY SIMILARITY. 

FT CARBOHYD 11 H N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 27 27 N-LINKED (GLCNAC. . .) (POTENTIAL) 

FT CARBOHYD 113 113 N-LINKED (GLCNAC. . .) (POTENTIAL) 

FT CARBOHYD 197 197 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 373 AA; 42212 MW; 94 4 125E9F4560BB3 CRC64; 

Query Match 28.8%; Score 475; DB 1; Length 373; ' 

Best Local Similarity 37.3%; Pred. No. 2.7e-24; 

Matches 107; Conservative 55; Mismatches 117; Indels 8; Gaps 7; 

QY 20 YYLSAFYAI EFI FGLLGNW VVFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS Y 79 
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1:11 
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D FQT P EMC D FNDRVYAT YQVT RGLAS LN S CVD P I L YFLAGDT FRRRL 335 

RESULT 4 

P2YR BOVIN ^ 
ID ~P2YR_BOVIN STANDARD ; PRT; 373 AA. 

AC P48042; 

DT 01-FEB-1996 (Rel. 33, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor). 

GN P2RY1. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomx ; 

OC MaSia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoxdea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxI D=9 9 1 3 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Aortic endothelium; 

RX MEDLINE=95352058; PubMed=7626079 ; 

S Henderson D.J., Elliot D.G., Smith G.M., Webb T . E . , Dainty I.A., 

RT "Cloning and characterisation of a bovine P2Y receptor. ; 

RL Biochem. Biophys . Res. Commun. 212:648-656(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Corpus callosum; 

RX MEDLINE=99064562; PubMed=9848096 ; 

RA Denq G., Matute C, Kumar C.K., Fogarty D.J., Miledi ' . 

S -Sonini and expression of a P2y purinoceptor from the adult bovine 

RT corpus callosum."; 

RL Neurobiol. Dis . 5:259-27 0(1998). 

CC -!- FUNCTION: Receptor for extracellular adenine nucleotides such as 
CC ATP and ADP . 

CC SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein^coupled^recep^ 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation 



CC 
CC 



the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 



cc 
cc 
cc 
cc 


modified and this 
entities requires 
or send an email 


statement is not removed. Usage by and for commercial 
a license agreement (See http://www.isb-sib.ch/announce/ 
to license@isb~sib. ch) . 


DR 


EMBL; X8.7628; CAA60958. 


l; 




DR 


EMBL; U34041; AAC78275. 


1; 




DR 


PIR; JC4162; JC4162. 






DR 


HSSP; P34996; 1DDD. 






DR 


InterPro; 


IPR000276; GPCR Rhodpsn. 


DR 


Pfam; PF00001; 7tm 1; 1 






DR 


PRINTS; PR00237; ( 


SPCRRHODOPSN. 


DR 


rKUol I Hi ; 


PS 002 37 ; 


G_PROTEIN 


_RECEP_F1 1; 1. 


DR 


DDPiC TTI? > 

rKUol lb, 


PSoOz 62 ; 


G PROTEIN 


_RECEP_F1 2; 1. 


KW 


G-protein 


coupled 


receptor; 


Transmembrane; Glycoprotein. 
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5 (POTENTIAL) . 
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CYTOPLASMIC (POTENTIAL) . 
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6 (POTENTIAL) . 


FT 


DOMAIN 


286 


303 
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TRANSMEM 


304 


328 




7 (POTENTIAL) . 
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DOMAIN 
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CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 
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N-LINKED (GLCNAC. . .) (POTENTIAL). 
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N-LINKED (GLCNAC. . . ) (POTENTIAL). 
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N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


197 
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N-LINKED (GLCNAC. . .) (POTENTIAL). 


SQ 


SEQUENCE 


373 AA; 


42287 MW 


; 9270A7175C0BDA76 CRC64; 


Query Match 




28. 


7%; 


Score 474; DB 1; Length 373; 



Best Local Similarity 36.6%; Pred. No. 3.1e-24; 
Matches 105; Conservative 58; Mismatches 116; Indels 8; Gaps 6 

^ 20 YYLSAFYAIEFIFGLLGNWVVFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

I N I I : I I I I I | : : : : | M I : : I I : II I : : : I I : : I I I | I | 
Db 52 YYLPAVYI LVFI I GFLGNSVAIWMFVFHMKPWSGI SVYMFNLALADFLYVLTLPALI FYY 111 

Qy 80 ANDKG-TYGDVLCISNRYVIjHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I : M : I I :: I I I I I I I I I I I | | | : | | : | | : | : | 

Db 112 fnk TDWIFGDAMCKLQRFIFHVNLYGSILFLTCISAHRYSGWYPLKSLGRLKKKNAVYI 171 

Qy 139 SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

I : I I : I : : I : I : : : | : | | | I I I : I I : I : I I 

Db 172 SVLVW LIVWGISPILFYSGTGIRKNKTITCYDTTSDEYLRSYFIYSMCTTVAMFCVPLV 231 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLVVl^VVIFSILFTPYHIMRNLRIASRL 256 

I ; l I = : M I II:: : :|:: : |:|:|: : : :|| 
Db 232 LI LGC YGLI VRALI YKDLDNS PLRRKS I YLVI I VLTVFAVS YI PFHVMKTMNLRARL 28 8 

2y 257 D-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 



I I : I : • I : I I I I I I I : : I I I I I I I : I I 

Db 289 DFQTPEMCAFNDRVYATYQVTRGLAS LNSCVDPILYFLAGDTFRRRL 335 



RESULT 5 
P2YRJHUMAN 

ID P2YR_HUMAN STANDARD; PRT; 373 AA 

AC P47900; 

DT 01-FEB-1996 (Rel. 33 f Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor). 

GN P2RY1. ^ 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo 

OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 

RX MEDLINE=96257237; PubMed=8 6662 90; 

RA Leon C, Vial C, Cazenave J. -P., Gachet C; 

RT "Cloning and sequencing of a human cDNA encoding endothelial P2Y1 

RT purinoceptor , " ; 

RL Gene 171:295-297(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96158962; PubMed=8579591 ; 

RA Ayyanathan K. , Tania W. , Harbansjit s., Raghbir A.S., Barnard E A 

RA Kunapuli S.P.; 

RT "Cloning and chromosomal localization of the human P2Y1 

RT purinoceptor . " ; 

RL Biochem. Biophys . Res. Commun. 218:783-788(1996) 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-96205320; PubMed=8 630005 ; 

RA Janssens R. , Communi D., Pirotton S., Samson M. , Parmentier M. , 

RA Boeynaems J.M. ; 

RT "Cloning and tissue distribution of the human P2Y1 receptor."; 

RL Biochem. Biophys. Res. Commun. 221:588-593(1996). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Blood; 

RA Leon C, Vial C, Weber J., Cazenave J. -P., Gacher C; 

RL Submitted (JUN-1998) to the EMBL/ GenBank/DDB J databases 

RN [5] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lung; 

RA Puhl H.L. Ill, Ikeda S.R., Aronstam R.S.; 

RT "cDNA clones of human proteins involved in signal transduction 

RT sequenced by the Guthrie cDNA resource center (www.cdna.org)."; 

RL Submitted (JUL-2002) to the EMBL/ GenBank/DDBJ databases 

RN [6] 

RP SEQUENCE OF 95-373 FROM N.A. , FUNCTION, AND INHIBITION STUDIES 

RC TISSUE=Platelet; 

RX MEDLINE=98113162; PubMed=94 42 04 0 ; 

RA Jin J., Daniel J.L., Kunapuli S.P.; 



RT "Molecular basis for ADP-induced platelet activation. II. The P2Y1 

RT receptor mediates ADP-induced intracellular calcium mobilization and 

RT shape change in platelets."; 

RL J. Biol. Chem. 273:2 030-2034(1998). 

CC FUNCTION: Receptor for extracellular adenine nucleotides such as 

CC ATP and ADP. In platelets binding to ADP leads to mobilization of 

CC intracellular calcium ions via activation of phospholipase C, a 

CC change in platelet shape, and probably to platelet aggregation. 

CC SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- INDUCTION: Repressed by the P2Y1 receptor-specific antagonists 
CC A3 P 5 P S , A3P5P and A2P5P. These inhibit calcium ion mobilization 

cc and shape change in platelets . 

-!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 



CC 
CC 



DR 
DR 
DR 
DR 
DR 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; Z49205; CAA89066.1; -. 

DR EMBL; U42030; AAA97873.1; 

DR EMBL; U42029; AAA97872.1; -. 

DR EMBL; S81950; AAB47091.1; -. 

DR EMBL; AJ006945; CAA07339.1; -. 

DR EMBL; AY136752; AAN01278.1; -. 

DR EMBL; AF018284; AAB94556.1; -. 

DR PIR; JC4737; JC4737. 

DR HSSP; P34996; 1DDD. 

DR Genew; HGNC:8539; P2RY1. 

DR MIM; 601167; -. 

GO; GO: 0005887; C:integral to plasma membrane; TAS. 
GO; GO:0004872; F:receptor activity; TAS. 

GO; GO: 0007166; P:cell surface receptor linked signal transdu. . .; TAS. 
GO; GO: 0007200; P:G-protein signaling, coupled to IP3 second . . .; TAS 
InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Platelet; 

KW Blood coagulation. 

FT DOMAIN 

FT TRANSMEM 

FT DOMAIN 

FT TRANSMEM 

FT DOMAIN 

FT TRANSMEM 

FT DOMAIN 

FT TRANSMEM 

FT DOMAIN 

FT TRANSMEM 

FT DOMAIN 

FT TRANSMEM 
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5 (POTENTIAL). 
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FT 


CONFLICT 


138 


138 


SQ 


SEQUENCE 


373 AA; 


421 


Query Match 




21 



42071 MW; 



7 (POTENTIAL) . 
CYTOPLASMIC ( POTENTIAL) . 
BY SIMILARITY. 

N-LINKED (GLCNAC . . .) (POTENTIAL) 
N-LINKED (GLCNAC. . .) (POTENTIAL) 
N-LINKED (GLCNAC. . . ) (POTENTIAL) 
N-LINKED (GLCNAC. . .) (POTENTIAL) 
MISSING (IN REF. 1) . 

4DC7C668B4145392 CRC64; 



Best Local Similarity 37. 3^ 
Matches 107; Conservative 



; Pred. No. 3.1e-24; 
55; Mismatches 117; Indels 



Gaps 



Qy 


20 


Db 


52 


Qy 


80 


Db 


112 


Qy 


139 


Db 


172 


Qy 


198 


Db 


232 


Qy 


257 


Db 


289 



1 1 



1 1 



I I I I I : : : : I I I I : : || : | | | 



I I I I I I I I I I I I II 



:| : : I I : III 



I I I 



I : II 



111:11: I : I I 



6; 



M I II:: : :|:: : |:|:|: : : :|| 
- PLRRKS I YLVI IVLTVFAVS YI PFHVMKTMNLRARL 288 



II II III 



II I I I I I : I | 



RESULT 6 
P2YR_CHICK 

ID P2YR_CHICK STANDARD; PRT; 362 AA. 

AC P34996; 

DT 01-FEB-1994 (Rel. 28, Created) 

DT 01-FEB-1994 (Rel. 28, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor) 

GN P2RY1. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=93285340; PubMed=850892 4 ; 

RA Webb T.E., Simon J., Krishek B.J., Bateson A.N., Smart T.G., 

RA King B.F., Burnstock G., Barnard E.A.; 

RT "Cloning and functional expression of a brain G-protein-coupled ATP 

RT receptor."; 



RL 

RN 

RP 

RX 

RA 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 



_ | 
- t 



FEBS Lett. 324:219-225(1993). 
[2] 

3D- STRUCTURE MODELING. 
MEDLINE=97026278; PubMed=8 872457 ; 

van Rhee A.M., Fischer B., van Galen P.J.M., Jacobson K.A. ; 
"Modelling the P2Y purinoceptor using rhodopsin as template."; 
Drug Des. Discov. 13:133-140(1995). 

-!- FUNCTION: Receptor for extracellular adenine nucleotides such as 
ATP and ADP. Seems to mediate its action via a pertussis toxin 
insensitive G-protein, probably belonging to the Gq family that 
activate a phosphatidylinositol-calcium second messenger system. 
SUBCELLULAR LOCATION: Integral membrane protein. 

TISSUE SPECIFICITY: Brain, spinal cord, gastrointestinal tract, 
spleen and leg muscle. Is not detected in the heart, liver, 
stomach, lung and kidney. 
-!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; X73268; CAA51716.1; -. 

PIR; S33733; S33733. 

PDB; 1DDD; ll-JUL-96. 

InterPro; IPR000276; GPCR_Rhodpsn. 

Pfam; PF00001; 7tm_l; 1. 

PRINTS; PR00237; GPCRRHODOPSN . 

PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

PROSITE; PS50262; G_PR0TEIN_RECEP_F1~2 ; 1. 

G-protein coupled receptor; Transmembrane; Glycoprotein; 3D-structure . 
noMATM 1 /ii EXTRACELLULAR (POTENTIAL). 

1 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

2 (POTENTIAL) . 
EXTRACELLULAR ( POTENTIAL) . 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL). 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 

N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
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SEQUENCE 


362 AA; 


41194 



Query Match 28.6%; Score 472.5; DB 1; Length 362; 

Best Local Similarity 35.9%; Pred. No. 3.8e-24; 

Matches 110; Conservative 58; Mismatches 121; Indels 17; Gaps 8 



Qy 


10 


WLA TEAILNK YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 

1 1 1 : II Ml 1 : II 1 III : : : : | | : | : : | | : | | 
WAAGNATTKCSLTKTGFQFYYLPTWILVFITGFLGNSVAIWMFVFHMRPWSGISVYMFN 


60 


Db 


22 


81 


Qy 


61 


LSISDFAFLCTLPILIKSYANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLL 
' ::s H :: HI N M HI hi 1:: 1 III IMIII ||: M 
LM^ADFLYVLTLPALIFYYFNKTDWIFGDVMCKLQRFI^ 


119 


Db 


82 


141 


Qy 


120 


MKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPE 
: = i : | | : | : : | Ml, : I : | : : I : : | | : 

WHPLKSLGRLKKKNAVYVSSLVWALWAVIAPILFYSGTGVRRNKTITCYDTTADEYLR 


178 


Db 


142 


201 


Qy 


179 


HNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIF 


237 


Db 


202 


: 1 1 : 1 1 : III I : 1 : | | : : | | | | | s - . : | 
S YFVYSMCTTVFMFCI P FI VILGCYGLI VKALI YKDLDNS PLRRKS I YLVI I VLTVF 
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Qy 


238 


SILFTPYHIMRNLRIASRLD-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGD 

:: : 1 : 1 : 1 : 1 : : 1 : : 1 : 1 1 1 1 1 1 1 :: 1 1 I I I | | 

AVS YL P FH VMKT LN L RARL D FQ T P QMC AFN D KVYAT YQ VT RGLAS LN S C VD P I L Y FLAG D 


295 


Db 


259 


318 


Qy 


296 


HYREML 301 




Db 


319 


: 1 1 
TFRRRL 324 





RESULT 7 
P2YR_MELGA 

ID P2YR_MELGA STANDARD; PRT; 362 AA. 

AC P49652; 

DT 01-FEB-1996 (Rel. 33, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor) (6H1 

DE orphan receptor) . 

GN P2RY1. 

OS Meleagris gallopavo (Common turkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Meleaqris. 

OX NCBI_TaxID=9103; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=94335907; PubMed=8058061; 

RA Filtz T.M., Li Q., Boyer J.L., Nicholas R.A. , Harden T.K.; 

RT "Expression of a cloned P2Y purinergic receptor that couples to 



RT phospholipase C"; 

RL Mol. Pharmacol. 4 6:8-14(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97382456; PubMed=924 04 60 ; 

RA Li Q., Schachter J.B., Harden T.K., Nicholas R.A. ; 

RT "The 6H1 orphan receptor, claimed to be the P 2y5 receptor, does not 

RT mediate nucleotide-promoted second messenger responses."; 

RL Biochem. Biophys . Res. Commun. 236:455-460(1997). 

CC -!- FUNCTION: Receptor for extracellular adenine nucleotides such as 
CC ATP and ADP. Seems to mediate its action via a pertussis toxin 

CC insensitive G-protein, probably belonging to the Gq family that 

CC activate a phosphatidylinositol-calcium second messenger system. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein 

CC -!- TISSUE SPECIFICITY: Mainly found in blood, brain, and lung. To a 
CC lesser extent in stomach, gut and skeletal muscle. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 



CC 



KW 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U09842; AAA18784.1; 
DR EMBL; AF012103; AAB65428.1; 
DR HSSP; P34996; 1DDD. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 
DR Pfam; PF00001; 7tm_l; 1. 
DR PRINTS; PR00237; GPCRRHODOPSN . 
DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 
DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

G-protein coupled receptor; Transmembrane; Glycoprotein. 
™ M7lTXT n " EXTRACELLULAR (POTENTIAL). 

1 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

2 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 

N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
80 MW; 3E128AB9EB64349C CRC64; 
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SQ 


SEQUENCE 


362 AA; 


411 



Query Match 28.6%; Score 472.5; DB 1; Length 362; 

Best Local Similarity 35.9%; Pred. No. 3.8e-24; 

Matches 110; Conservative 58; Mismatches 121; Indels 17; Gaps 8 

Qy 10 WLA TEAILNK YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

li 1=11 III I : II I III : : : : I I : I : : | | : | | 

Db 22 WAAGNASTKCSLTKTGFQFYYLPTVYILVFITGFLGNSVAIWMFVFHMRPWSGISVYMFN 81 

Q y 61 LSISDFAFLCTLPILIKSYANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLL 119 

l:::M := III II II :|||:| I : : I I II I I I I I I I I : | | 

Db 82 LALADFL YVLTLPALI FYYFNKTDWI FGDVMCKLQRFI FHVNLYGSI LFLTCI SVHRYTG 141 

Qy 120 MKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPE 178 

: : I : I I : I : : | | | | | | : | : | : : | : : | | : 

Db 142 VVHPLKSLGRLKKKNAVYVSSLVWALWAVIAPILFYSGTGVRRNKTITCYDTTADEYLR 201 

Q y 179 HNL IYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPL-DKPQRLVV1AWIF 237 

:M:| I: I || |: I :| I : : || | ||:: : : | 
Db 202 SYFVYSMCTTVFMFCIPFIVILGCYGLIVKALIYKDLDNS PLRRKSIYLVIIVLTVF 258 

Q y 238 SILFTPYHIMRNLRIASRLD-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGD 295 

: 1 = 1:1: I : :IM III : : | : | | | | | | | : : | | || | || 

Db 259 AVSYLPFHVMKTLNLRARLDFQTPQMCAFNDKVTATYQVTRGLASLNSCVDPILYFLAGD 318 

Qy 296 HYREML 301 

: I I 

Db 319 TFRRRL 324 



RESULT 8 
GP8 0_HUMAN 

ID GP80_HUMAN STANDARD; PRT ; 337 AA. 

AC Q96P68; Q86TL1; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Probable G protein-coupled receptor GPR80 (P2Y-like nucleotide 

DE receptor) (P2Y-like GPCR) . 

GN GPR80 OR GPR99. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21458557; PubMed=l 1574 155 ; 

RA Lee D.K., Nguyen T . , Lynch K.R., Cheng R., Vanti W.B., Arkhitko O., 

RA Lewis T., Evans J.F., George S.R., O'Dowd B.F.; 

RT "Discovery and mapping of ten novel G protein-coupled receptor 

RT genes."; 

RL Gene 275:83-91(2001). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 

RX PubMed=12098360; 

RA Wittenberger T. , Hellebrand S., Munck A., Kreienkamp H.-J., 

RA Schaller H.C., Hampe W. ; 



RT "GPR99, a new G protein-coupled receptor belonging to a new subgroup 
RT of nucleotide receptors."; 
RL BMC Genomics 3:17-17(2 002). 
RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22040266; PubMed=12044 87 8 ; 

RA Takeda S., Kadowaki S., Haga T., Takaesu H., Mitaku S.; 

RT "Identification of G protein-coupled receptor genes from the human 

RT genome sequence."; 

RL FEBS Lett. 52 0:97-101(2 002). 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Suwa M. , Sato T., Okouchi I., Arita M., Futami K., Matsumoto S., 

RA Tsutsumi S., Aburatani H., Asai K., Akiyama Y. ; 

RT "Genome-wide discovery and analysis of human seven transmembrane helix 

RT receptor genes."; 

RL Submitted (JUL-2001) to the EMBL/ GenBank/DDB J databases 

RN [5] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Thyroid; 

RA Bruess M. , Bonisch H., von Kugelgen I.; 

RT "Molecular cloning and functional characterization of a new human P2Y 

RT receptor."; 

RL Submitted (FEB-2003) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: Orphan receptor. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Detected in kidney and, to a lower extend, in 
CC placenta. Not detected in brain tissues including the frontal 

CC cortex, caudate putamen, thalamus, hypothalamus, hippocampus or 

CC pons . 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF411109; AAL26480.1; -. 

DR EMBL; AF370886; AAM76912.1; -. 

DR EMBL; AB083598; BAB89311.1; 

DR EMBL; AB065877; BAC06095.1; -. 

DR EMBL; AJ305372; CAC83857.1; -. 

DR Genew; HGNC:4531; GPR8 0. 

DR MIM; 606922; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN. 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; FALSE NEG. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 

FT DOMAIN 1 34 EXTRACELLULAR (POTENTIAL). 

FT TRANSMEM 35 55 1 (POTENTIAL). 

FT DOMAIN 56 69 CYTOPLASMIC (POTENTIAL). 

FT TRANSMEM 70 90 2 (POTENTIAL). 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

DISULFID 

CARBOHYD 

CARBOHYD 

CARBOHYD 

CARBOHYD 

CONFLICT 

CONFLICT 

SEQUENCE 



91 
117 
138 
152 
173 
202 
223 
243 
264 
285 
306 
106 

10 

23 
176 
179 

14 
236 
337 AA; 



116 
137 
151 
172 
201 
222 
242 
263 
284 
305 
337 
183 

10 

23 
176 
179 

14 
236 

38251 



MW; 



EXTRACELLULAR (POTENTIAL) . 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL). 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL). 
CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 

N-LINKED (GLCNAC. . .) (POTENTIAL) 
N-LINKED (GLCNAC. . . ) (POTENTIAL) 
N-LINKED (GLCNAC. . .) (POTENTIAL) 
N-LINKED (GLCNAC. . .) (POTENTIAL) 
F -> L (IN REF. 5) . 
K -> R (IN REF. 5) . 

6814EA0044756CE6 CRC64 ; 



6; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



Query Match 25.8%; Score 426; DB 1; Length 337; 

Best Local Similarity 34.4%; Pred. No. 3.8e-21; 

Matches 101; Conservative 57; Mismatches 126; Indels 10; Gaps 

17 LNKYYLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI 76 

I ; H I I I: I II I: I : I I : I I I : : I I : : | :| :|| || 
31 LKMHYLPVIYGIIFLVGFPGNAWISTYIFKMRPWKSSTIIMLNLACTDLLYLTSLPFLI 90 

77 KSYAN-DKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA 135 

I I : : = I I : I I : I I I I : II I I I I I : I I : : : | : | | 

91 HYYASGENWIFGDFMCKFIRFSFHFNLYSSILFLTCFSIFRYCVTIHPMSCFSIHKTRCA 150 

136 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLI— YSLCLTLLGFL 193 

:: N : : |:|| I I : I hi II III |:| || | 

151 WACAWWI I S LVAVI PMT FLI T STNRTNRS ACLDLT S S — DELNT I KW YNLI LTATT FC 208 

194 I PLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLVVLAVVI FS I LFTP YHIMRNLRIA 253 

; H :: I :: I I : I : I I : I : : I : I I : | | : | : | | 

209 LPLVIVTLCYTTIIHTLTHGLQTDSCL— KQKARRLTILLLLAFYVCFLPFHILRVIRIE 266 

254 SRLDSWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFR 306 

INI I : : I I : : I I I I I I : I : | : : | : : : : : | | 

267 SRLLS--ISCSIENQIHEAYIVSRPLAALNTFGNLLLYVWSDNFQQAVCSTVR 318 



RESULT 9 
P2Y8_XENLA 

ID P2Y8_XENLA STANDARD; PRT; 537 AA. 

AC P79928; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE P2Y purinoceptor 8 (P2Y8). 

GN P2RY8. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 



OC Xenopodinae; Xenopus . 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Neural plate; 

RX MEDLINE=97284734; PubMed=9139711 ; 

RA Bogdanov Y.D., Dale L., King B.F., Whittock N., Burnstock G. ; 

RT "Early expression of a novel nucleotide receptor in the neural plate 

RT of Xenopus embryos."; 

RL J. Biol. Chem. 272:12583-12590(1997). 

CC -!- FUNCTION: Receptor for extracellular ATP, UTP, CTP, GTP and ITP. 
CC The activity of this receptor is mediated by G proteins which 

CC activate a phosphatidylinositol-calcium second messenger system. 

CC May play a key role in the early development of neural tissue. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X99953; CAA68213.1; -. 
DR HSSP; P34996; 1DDD. 

DR InterPro; IPR000276; GPCR_Rhodpsn. 
DR Pfam; PF00001; 7tm_l; 1. 
DR PRINTS; PR00237; GPCRRHODOPSN . 
DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 
DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

G-protein coupled receptor; Transmembrane; Glycoprotein. 

EXTRACELLULAR (POTENTIAL). 

1 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

2 { POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 

N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
W; B2CF24812F3C19F2 CRC64; 



KW 



FT 


DOMAIN 


1 


49 


FT 


TRANSMEM 


50 


70 


FT 


DOMAIN 


71 


79 


FT 


TRANSMEM 


80 


100 


FT 


DOMAIN 


101 


118 


FT 


TRANSMEM 


119 


139 


FT 


DOMAIN 


140 


161 


FT 


TRANSMEM 


162 


182 


FT 


DOMAIN 


183 


210 


FT 


TRANSMEM 


211 


231 


FT 


DOMAIN 


232 


254 


FT 


TRANSMEM 


255 


275 


FT 


DOMAIN 


276 


292 


FT 


TRANSMEM 


293 


316 


FT 


DOMAIN 


317 


537 


FT 


DISULFID 


116 


193 


FT 


CARBOHYD 


26 


26 


FT 


CARBOHYD 


29 


29 


SQ 


SEQUENCE 


537 AA; 


6202 



Query Match 23.4%; Score 386.5; DB 1; 

Best Local Similarity 35.3%; Pred. No. 2.1e-18; 
Matches 104; Conservative 51; Mismatches 125; 



Length 537; 

Indels 15; Gaps 7; 



Qy 22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSYAN 81 

I I : I : II I : : : = : I : I I : I I : I I I : : I I : : : | | | : | | 

Db 46 LPVSYSAVFMVGLPLNIAAMWIFIAKMRPWNPTTVYMFNIALSDTLYVLSLPT 104 

QY 82 DKGT--YGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILIS 139 

II : I : I I I I : : : II I : I I I I I I I | : | | : : | : | : | 

Db 105 DKNNWPFGEVLCKLVRFLFYANLYSSILFLTCISVHRYRGVCH?ITSLRRMNAKHAYVIC 164 

^ 140 IAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

II Ml ::| I |: II : : | | : : || : | | || :: 

Db 165 ALVWLSVTLCLVPNLI FVTVS PKVKNTICHDTTRPEDFARYVEYSTAIMCLLFGI PCLI I 224 

Qy 200 CFFYYKMWFLKRR-- SQQQATALPLDKPQ— RLWLAWIFSILFTPYHIMRNLRIASR 255 

II I : I I I M I :::::: hi I |:|| | | :| 
Db 225 AGCYGLMTRELMKPIVSGNQQT-LPSYKKRSIKTIIFVMIAFAICFMPFHITRTLYYYAR 283 

Qy 256 LDSWPQGCTQKAIKSI— YTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQ 307 



i * i i - i i i i i iii.iiiii in || | : 
284 L LGIKCYALWINVTYKVTRPLASiWSCIDPILYFLANDRYRRRLIRTVRR 334 



RESULT 10 
P2Y3_CHICK 

ID P2Y3_CHICK STANDARD; PRT; 32 8 AA. 

AC Q98907; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE P2Y purinoceptor 3 (P2Y3) (Nucleoside diphosphate receptor) 

GN P2RY3. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus . 

OX NCBI_TaxID=9031 ; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Brain; 

RX MEDLINE=96319774; PubMed=8700132 ; 

RA Webb T.E., Henderson D., King B.F., Wang S., Simon J., 

RA Bateson A.N., Burnstock G., Barnard E.A. ; 

RT "A novel G protein-coupled P2 purinoceptor (P2Y3) activated 

RT preferentially by nucleoside diphosphates."; 

RL Mol. Pharmacol. 50:258-265(1996). 

CC -!- FUNCTION: Receptor for extracellular ADP > UTP > ATP = UDP. The 
CC activity of this receptor is mediated by G proteins which activate 

cc a Phosphatidylinositol-calcium second messenger system. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



DR 


EMBL; X98 


283; CAA66930. 


l; ~ 




DR 


HSSP; P34996; 1DDD. 






nn 
UK 


InterPro; 


IPR000276; GPCR Rhodpsn. 


DR 


Pfam; PF00001; 7tm 1; 1 






DR 


PRINTS; PR00237; GPCRRHODOPSN . 


DR 


PROSTTE ■ 


r o kj \j o f t 


G_PROTEIN_ 


_KhLhP Fl 1; FALSE NEG. 


DR 


PROSTTF. • 


PS SO? fy? ' 


G_PROTEIN~ 


RhCEP Fl 2; 1. 


KW 




(^o upiea 


receptor; 


Transmembrane; Glycoprotein. 


FT 


DOMAIN 


1 


22 




EXTRACELLULAR (POTENTIAL) . 


FT 


X I^jT-LN * J I'l "il l 




43 




1 (POTENTIAL) . 


FT 


LJ\JrLr\±ri 




57 




CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMFM 


R O 

o 0 


1 o 
/ O 




2 (POTENTIAL) . 


FT 


DOMATM 

U^JL Y Lr\.X IN 


7 Q 


96 




EXTRACELLULAR (POTENTIAL) . 


FT 




Q7 


117 




3 (POTENTIAL) . 


FT 


DOMAIN 


118 


139 




CYTOPLASMIC (POTENTIAL) . 


FT 


TRANS MEM 


140 


160 




4 (POTENTIAL) . 


FT 


DOMAIN 


161 


189 




EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


190 


210 




5 (POTENTIAL) . 


FT 


DOMAIN 


211 


231 




CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


232 


252 




6 (POTENTIAL) . 




DOMAIN 


253 


275 




EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


276 


298 




7 (POTENTIAL) . 


FT 


DOMAIN 


299 


323 




EXTRACELLULAR (POTENTIAL) . 


FT 


CARBOHYD 


5 


5 




N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


DISULFID 


94 


172 




BY SIMILARITY. 


SQ 


SEQUENCE 


32 8 AA; 


375E 


J6 MW; 7A3BF1C91F54FAAB CRC64; 



Query Match 23.0%; Score 379.5; DB 1; Length 328; 

Best Local Similarity 29.6%; Pred. No. 3.9e-18; 



Matches 88; Conservative 63; Mismatches 135; Indels 11; Gaps 4; 

Qy 22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSYA- 80 

I I :: I : I I I I : I : : | : | | : : : | : : | : | | : | | : | 

Db 24 LPLVYSVVFLLGLPLNAVVI GQIWLARKALTRTTIYMLNLAMADLLYVCSLPLLI YNYTQ 83 

Qy 81 NDKGT YGDVLCI SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYPFRE-HFLQKKEFAI LI S 139 

I *ll I I: :|||: MINI ||: ||: : :| | : |; | : 

Db 84 KDYWPFGDFTCKFV^FQFYTNLHGSILFLTCISVQRYMGICHPLASWHKKKGKKLTWLVC 143 

Qy 14 0 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

'I' : 1 : I I I : : = : I I : I : I I : I I I : I : : 

Db 144 AAVWFIVIAQCLPTFVFASTGTQRNRTVCYDLSPPDRSTSYFPYGITLTITGFLLPFAAI 203 

^ 200 CFFYYKMWFLKRRSQQQATAL- - PLDKPQRLVVLAWI FS I LFTP YHIMRNLRIASRLD 257 

I I I : : : I : II I : : : : | : : | | | I I : | : : : : | 

Db 204 I^CYCSMARILCQKDELIGI^VIIKKKDKAVRMIIIVVIVFSISFFPFHLTKTIYLIVRSS 263 

Qy 258 SWPQGCTQKAIKSI YTLTRPLAFLNSAINPIFYFLMGDHYRE MLISKFRQ 307 



1 * 1 'ii "ii - =11 : I ! : II 

264 ASLPCPTLQAFAIAYKCTRPFASMNSVLDPILFYFTQRKFRESTRYLLDKMSSKWRQ 320 



RESULT 11 
P2Y2_RAT 

ID P2Y2_RAT STANDARD; PRT; 374 AA. 

AC P41232; 



DT 01-FEB-1995 (Rel. 31, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE P2Y purinoceptor 2 (P2Y2) (P2U purinoceptor 1) (P2U1) (ATP receptor) 

DE (Purinergic receptor) . 

GN P2RY2 OR P2RU1. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus 

OX NCBI_TaxID=10116; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=95110548; PubMed=7 811468 ; 

RA Rice W.R., Burton F.M., Fiedeldey D.T.; 

RT "Cloning and expression of the alveolar type II cell P2u-purinerqic 

RT receptor."; 

RL Am. J. Respir. Cell Mol . Biol. 12:27-32(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; TISSUE=Pituitary ; 

RA Chen Z.P., Krull N., Xu S., Levy A. , Lightman S.L.; 

RL Submitted (AUG-1995) to the EMBL/ GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; 

RX MEDLINE=98099857; PubMed=9437211 ; 

RA Seye C.I., Gadeau A. P., Daret D. , Dupuch F. , Alzieu P., Capron L., 

RA Desgranges C; 

RT "Overexpression of P2Y2 purinoceptor in intimal lesions of the rat 

RT aorta."; 

RL Arterioscler. Thromb. Vase. Biol. 17:3602-3610(1997). 

CC -!- FUNCTION: Receptor for ATP and UTP coupled to G-proteins that 
CC activate a phosphatidylinositol-calcium second messenger system. 

CC The affinity range is UTP = ATP > ATP-gamma-S » 2-methylthio-ATP 

CC = ADP. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U09402; AAA61565.1; 

DR EMBL; L46865; AAB02099.1; -. 

DR EMBL; U56839; AAC00048.1; -. 

DR HSSP; P34996; 1DDD. 

DR InterPro; IPR000276; GPCR^Rhodpsn. 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 

FT DOMAIN 1 32 EXTRACELLULAR (POTENTIAL). 



FT 


TRANSMEM 


33 


59 


1 / PHTTrMTT^T \ 
X \ CKJ x JiiN 1 x/YLi ) . 


FT 


DOMAIN 


60 


7fi 


U 1 1 UrrliAbjyil C (POTENTIAL) . 


FT 


TRANSMEM 


71 






FT 


DOMAIN 




1 1 n 


PYT'DRPTTT T TTT 7Vn / nAmmimT t\ t l 

ilai KAUtijijUliAR (POTENTIAL) . 


FT 


TRANSMEM 


111 


1 0 Q 
x ^. _? 


O { rUl lMM 1 1AL ) . 


FT 


DOMAIN 


130 


X *J 


Ui l Uh'lxA.bJyilL (POTENTIAL) . 


FT 


TRANSMEM 


153 


X / 


4 ( rU 1 HjISJ 1 1AL ) . 


FT 


DOMAIN 




1 

1 


iijAl KAUiijijljUiaAR (POTENTIAL) . 


FT 


TRANSMEM 


195 


220 


^ / DOTTJ 1 "NTT tbt \ 


FT 


DOMAIN 


221 


245 


fVTH D T 7\ Q Tyf T r* f Dr^TiTPXTm t t\ t \ 

o 1 1 ixRoiyiiL- (PUlhiNTIAL) . 


FT 


TRANSMEM 


246 


268 


fc\ /PHTVMTTAT \ 
D ^ zKj X xtiiN 1 J_/\Ij J . 


FT 


DOMATN 


^ u _? 


9 ft £> 


itiXi RALhLLULAR (POTENTIAL) . 


FT 




9R7 


^ n fi 


/ ( rUl tN 1 IAL) . 


FT 


DOMATM 

X/^l'Xrt.X 1^ 


O U -7 


o / *± 


LYIOPLASMIC (POTENTIAL) . 


FT 




Q 


Q 


N-LINKED (GLCNAC. . . ) ( POTENTIAL) 


FT 


CARROHYD 


1 ? 




N-LlNKbD (GLCNAC. . .) (POTENTIAL) 


FT 


DISULFID 


106 


183 


RY STMTT.ARTTY 


FT 


CONFLICT 


132 


132 


C -> S (IN REF. 1) . 


FT 


CONFLICT 


143 


143 


S -> R (IN REF. 1) . 


FT 


CONFLICT 


166 


166 


A -> T (IN REF. 1) . 


FT 


CONFLICT 


188 


188 


A -> D (IN REF. 1) . 


SQ 


SEQUENCE 


374 AA; 


42137 


MW; 2DA6853AA20A1AF3 CRC64; 



Query Match 22.7%; Score 374; DB 1; Length 374; 

Best Local Similarity 33.8%; Pred. No. 9.8e-18; 

Matches 96; Conservative 50; Mismatches 124; Indels 14; Gaps 7 

Qy 19 KY-YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 77 

II I I : : I I M : : : I : I I I : I I : I : I : : | | : : | | : | : 
Db 32 KWLLPVSYGWCVLGLCLNWALYIFLCRLKTWNASTTYMFHLAVSDSLYAASLPLLVY 91 



QV 78 SYA-NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAI 136 

M I : I M I ::: I I I I I I I I II I I : | | : I : [ 

Db 92 YYAQGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLHSLSWGHARYAR 151 

Qy 137 LISIAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

: - II M |:| |: : : I I :: I : II : I I :| 

Db 152 RVAAVVWVIjVTjACQAPVLYFVTTSVRGTRITCHDTSAK 211 

Qy 197 SVMCFFYYKMWFLKRRSQQQA TALPLDKPQ--RLVAALAWIFSILFTPYHIMRNLR 251 

I- I I : II : I I II I : I : I : :|:: I |:|: | | 

Db 212 SIILVCY VLMARRLLKPAYGTTGLPRAKRKSVRTIAiVIAVFTaCFLPFHVTRTLY 267 

Qy 252 IASRLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMG 294 

: I I I I II I : I I II I I I : : I : I I | | 

Db 268 YSFR— SLDLSCHTLNAINMAYKITRPLASANSCLDPVLYFLAG 309 



RESULT 12 
P2Y2_MOUSE 

ID P2Y2_MOUSE STANDARD; PRT ; 373 AA. 

AC P35383; 009031; Q9CPZ4 ; 

DT 01-JUN-1994 (Rel. 29, Created) 

DT 01-JUN-1994 (Rel. 29, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE P2Y purinoceptor 2 (P2Y2) (P2U purinoceptor 1) (P2U1) (ATP receptor) 
DE (Purinergic receptor) . 



GN P2RY2 OR P2RU1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=93281707; PubMed=7685114 ; 

RA Lustig K.D., Shiau A.K., Brake A. J., Julius D. ; 

RT "Expression cloning of an ATP receptor from mouse neuroblastoma 

RT cells."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:5113-5117(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96316177; PubMed=873 9319 ; 

RA Enomoto K., Furuya K. , Moore R.C., Yamagishi S., Oka T., Maeno T.; 

RT "Expression cloning and signal transduction pathway of P2U receptor 

RT in mammary tumor cells."; 

RL Biol. Signals 5:9-21(1996). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Head, and Liver; 

RX MEDLINE=21085660; PubMed=11217851; 

RA Kawai J., Shinagawa A., Shibata K. , Yoshino M., Itoh M. , Ishii Y., 

RA Arakawa T . , Hara A., Fukunishi Y., Konno H., Adachi J., Fukuda S.' 

RA Aizawa K. , Izawa M. , Nishi K., Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y., Gojobori T., Bono H., Kasukawa T., Saito R., 

RA Kadota K., Matsuda H.A., Ashburner M. , Batalov S., Casavant T., 

RA Fleischmann W., Gaasterland T., Gissi C, King B., Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G., Quackenbush J., 

RA Schriml L.M., Staubli F. , Suzuki R. , Tomita M., Wagner L., Washio T . , 

RA Sakai K. , Okido T., Furuno M., Aono H., Baldarelli R. , Barsh G. , 

RA Blake J., Boffelli D. , Bojunga N., Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C. , Fletcher C, Fujita M. , Gariboldi M. , 

RA Gustincich S., Hill D., Hofmann M. , Hume D.A. , Karniya M. , Lee N.H., 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J. , Mombaerts P., 

RA Nordone P., Ring B., Ringwald M. , Rodriguez I., Sakamoto N., 

RA Sasaki H. f Sato K., Schoenbach C, Seya T., Shibata Y., Storch K.-F., 

RA Suzuki H. f Toyo-oka K., Wang K.H., Weitz C, Whittaker C, Wilming L., 

RA Wynshaw-Boris A., Yoshida K. , Hasegawa Y., Kawaji H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 4 09:685-690(2001). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Breast tumor; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D. f Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. , Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M. J. , Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan'P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 



RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [5] 

RP MUTAGENESIS OF LYS-107; ARG-110; HIS-262; ARG-265; LYS-289 AND 

RP ARG-292. 

RX MEDLINE=95181393; PubMed=7876172 ; 

RA Erb L., Garrad R.C., Wang Y., Quinn T., Turner J.T., Weisman G.A. ; 

RT "Site-directed mutagenesis of P2U purinoceptors . Positively charged 

RT amino acids in transmembrane helices 6 and 7 affect agonist potency 

RT and specificity."; 

RL J. Biol. Chem. 270:4185-4188(1995). 

CC -!- FUNCTION: Receptor for ATP and UTP coupled to G-proteins that 
cc activate a phosphatidylinositol-calcium second messenger system. 

CC The affinity range is UTP = ATP > ATP-gamma-S » 2-methylthio-ATP 

CC = ADP. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Spleen, testis, kidney, liver, lung, heart and 
CC brain. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L14751; AAA39871.1; 

DR EMBL; S83099; AAB50735.1; -. 

DR EMBL; AK005013; BAB23746.1; -. 

DR EMBL; AK017378; BAB30719.1; -. 

DR EMBL; BC006613; AAH06613.1; -. 

DR PIR; A47556; A47556. 

DR HSSP; P34996; 1DDD. 

DR MGD; MGI: 105107; P2ry2. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 

FT DOMAIN 1 32 EXTRACELLULAR (POTENTIAL). 

FT TRANSMEM 33 59 1 (POTENTIAL). 

FT DOMAIN 60 70 CYTOPLASMIC (POTENTIAL). 

FT TRANSMEM 71 93 2 (POTENTIAL) . 

FT DOMAIN 94 110 EXTRACELLULAR (POTENTIAL). 

FT TRANSMEM 111 12 9 3 (POTENTIAL) . 

FT DOMAIN 130 152 CYTOPLASMIC ■( POTENTIAL) . 



FT 


TRANSMEM 


153 


1 79 

-L / Z 


Q ( rUl EN 1 IAL) . 


FT 


DOMATN 


17^ 


1 Q /I 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMF.M 


j_ _? *j 


Z Z U 


0 (POTENTIAL) . 


FT 


DOMAIN 


221 


94 £ 


Liiut'jjAbNIC (POTENTIAL) . 


FT 


TRANSMEM 


247 


9 


0 \ r U 1 HjM 1 IAL ) . 


FT 


DOMATN 


970 


Z 0 / 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


288 


309 


7 / PPlTPMT 1 TAT \ 


FT 


DOMAIN 


310 


37^ 


CI l uriiAoirilL. (rOlENTIAL) . 


FT 


CARBOHYD 


9 


q 


J>J .LINKED (GLCNAC. . .) (POTENTIAL) . 


FT 


CARBOHYD 


13 


J. o 


In LaJNJ\EJJ (OLCNAC. . . ) (POTENTIAL) . 


FT 


DISULFID 


106 




PV Q TTWfT T 7\OTfTiV 

n I o 1JV11 liAKl 1 Y . 


FT 


MUTAGEN 


1 D7 


1 07 


K->I : NO EFFECT ON RECEPTOR ACTIVATION. 


FT 


MTTTAflFN 


± ± u 


1 1 A 

1 1U 


R->L: NO EFFECT ON RECEPTOR ACTIVATION. 


FT 


MTTT APFM 


9 £0 
Z OZ 


Z OZ 


H->L: DECREASE IN RECEPTOR ACTIVATION. 


FT 






zoo 


R->L: DECREASE IN RECEPTOR ACTIVATION. 


FT 




9 Q Q 

Z o y 


O Q Q 

z o y 


K->I: NO EFFECT ON RECEPTOR ACTIVATION. 


FT 




z y z 


O O T 

z yz 


R->L: DECREASE IN RECEPTOR ACTIVATION. 


FT 


PfYMFT TPT 
^ WIN r L1L1 


1 "7 

i / 


X / 


E -> D (IN REF. 2) . 


FT 


V— WIN r ±j± ^ 1 


ion 
1Z u 


ion 
1Z U 


S -> R (IN REF. 2 ) . 


FT 


CONFLICT 


125 


1 OR 
16 J 


rp \ >t /TXT DPT? 0\ 

i ~> JN [Ibi REF. z) . 


FT 


CONFLICT 


196 


196 


V -> M (IN REF. 2) . 


FT 


CONFLICT 


263 


263 


V -> L (IN REF. 3) . 


FT 


CONFLICT 


355 


355 


D -> N (IN REF. 2) . 


FT 


CONFLICT 


369 


371 


KDI -> PYV (IN REF. 2) . 


SQ 


SEQUENCE 


373 AA; 


42174 


MW; 590BBE502E41B3AE CRC64; 



Query Match 22.6%; Score 373.5; DB 1; Length 373; 

Best Local Similarity 33.5%; Pred. No. l.le-17; 

Matches 94; Conservative 50; Mismatches 130; Indels 7; Gaps 5; 



Qy 


19 


KY-YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 
II 1 1 : : 1 1 M : : : 1 : I | | : | | : | : | : : || : : | | : | : 
KYVLLPVSYGWCVLGLCLNWALYIFLCRLKTWNASTTYMFHLAVSDSLYAASLPLLVY 


77 


Db 


32 


91 


Qy 


78 


S YA-NDKGTYGDVLCI SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYP FREHFLQKKEFAI 

1 1 : III 1 :: : 1 M 1 1 1 1 1 1 1 1 1 : I I : 1 : : | 
YYARGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLHSLRWGRARYAR 


136 


Db 


92 


151 


Qy 


137 


LISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 

|:| I: : : | | :: 1 : 1 1 : 1 1 :| 

RVAAVVWVLVTACQAPVLYFVTTSVRGTRITCHDTSARELFSHFVAYSSVMLGLLFAVPF 


196 


Db 


152 


211 


Qy 


197 


SVMCFFYYKMWFLKRRSQQQATALPLDKPQ — RLWLAVVIFSILFTPYHIMRNLRIAS 


254 


Db 


212 


M ; 1 1 1 : : II 1 : 1 : 1 : : 1 : : I | : | : | | : 
SVILVCYVlJytARRLLKPAYGTTGGLPRAKRKSWTIALVTAVFALCFLPFHW 


271 


Qy 


255 


RLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMG 294 
1 1 M M 1 : 1 1 I I I | | : : | : | | | | 
R--SLDLSCHTLNAINMAYKITRPLASANSCLDPVLYFLAG 310 




Db 


272 





RESULT 13 
P2Y3_MELGA 

ID P2Y3_MELGA STANDARD; PRT; 328 AA. 

AC 093361; 

DT 16-OCT-2001 (Rel. 40 r Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 



DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE P2Y purinoceptor 3 (P2Y3) (Nucleoside diphosphate receptor). 
GN P2RY3. 

OS Meleagris gallopavo (Common turkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Meleaqris . 

OX NCBI_TaxID=9103; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98401046; PubMed=9730913 ; 

RA Li Q., Olesky M., Palmer R.K., Harden T.K., Nicholas R.A. ; 

RT "Evidence that the p2y3 receptor is the avian homologue of the 

RT mammalian P2Y6 receptor."; 

RL Mol. Pharmacol. 54:541-546(1998). 

CC -!- FUNCTION: RECEPTOR FOR EXTRACELLULAR UDP > ADP = UTP. THE ACTIVITY 
CC OF THIS RECEPTOR IS MEDIATED BY G PROTEINS WHICH ACTIVATE A 

CC PHOSPHATIDYLINOSITOL-CALCIUM SECOND MESSENGER SYSTEM. 

CC SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use^ by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF069555; AAC23863.1; -. 

DR HSSP; P34996; 1DDD. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; FALSE_NEG. 

DR PROSITE; PS50262; G_PR0TEIN RECEP Fl 2; 1. 



KW 



FT 


DOMAIN 


1 


22 


FT 


TRANSMEM 


23 


43 


FT 


DOMAIN 


44 


57 


FT 


TRANSMEM 


58 


78 


FT 


DOMAIN 


79 


96 


FT 


TRANSMEM 


97 


117 


FT 


DOMAIN 


118 


139 


FT 


TRANSMEM 


140 


160 


FT 


DOMAIN 


161 


189 


FT 


TRANSMEM 


190 


210 


FT 


DOMAIN 


211 


231 


FT 


TRANSMEM 


232 


252 


FT 


DOMAIN 


253 


275 


FT 


TRANSMEM 


276 


298 


FT 


DOMAIN 


299 


323 


FT 


CARBOHYD 


5 


5 


FT 


DISULFID 


94 


172 


SQ 


SEQUENCE 


32 8 AA; 


375! 



EXTRACELLULAR (POTENTIAL) . 

1 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

2 ( POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL). 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 
N-LINKED (GLCNAC. . . ) (POTENTIAL) 
BY SIMILARITY. 

; B74D49B99C7164A5 CRC64; 



Query Match 22.5%; Score 371.5; DB 1; Length 328; 

Best Local Similarity 29.4%; Pred. No. 1.3e-17; 



Matches 87; Conservative 61; Mismatches 137; Indels 11; Gaps 4 

2y 22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSYA- 80 

I |:: I: II I |: | : : I : I I : : I : : | : | | : | | s | 

Db 24 LPLWSWFLLGLPLNAWIGQIWLARKALTRTTIYMLNLATADLLYVCSLPLLIYNYTQ 83 

QV 81 NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFRE-HFLQKKEFAILIS 139 

I : I I I I : : I I I : I I I I I I I I : I I : : : | I : | : | : 

Db 84 KDYWPFGDFTCKFVRFQFYTNLHGSILFLTCISVQRYMGICHPLASWHKKKGKKLTWLVC 143 

Qy 14 0 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

111:1 : I I I :: : : I | : I : I I : I I I : I : : 

Db 144 AAVWFIVIAQCLPTFVFASTGTQRNRTVCYDLSPPDRSASYFPYGITLTITGFLLPFAAI 203 

200 CFFYYKMWFLKRRSQQQATAL- - PLDKPQRLWLAWI FS I LFTP YHIMRNLRI AS RLD 257 
II I : h II I:::: I : : I I I I | : | : : : : | 

Db 204 LACYCSMARILCQKDELIGLAVHKKKDKAVRMIIIWIVFSISFFPFHLTKTIYLIVRSS 263 

QY 258 SWPQGCTQKAI KS I YTLTRPLAFLNSAINPI FYFLMGDHYRE MLISKFR 306 



' " ' * i i i i • i i : : i i : : : | | : [ | : | 

Db 264 PTLPCPTLQAFAIAYKCTRPFASMNSVLDPILFYFTQRKFRESTRYLLDKMSSKWR 319 

RESULT 14 
P2Y2 HUMAN 



ID P2Y2_HUMAN STANDARD; PRT; 377 AA. 

AC P41231; Q96EM8; 

DT 01-FEB-1995 (Rel. 31, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE P2Y purinoceptor 2 (P2Y2) (P2U purinoceptor 1) (P2U1) (ATP receptor) 

DE (Purinergic receptor) . 

GN P2RY2 OR P2RU1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Airway epithelium; 

RX MEDLINE=94211846; PubMed=8 159738 ; 

RA Parr C.E., Sullivan D.M., Paradiso A.M., Lazarowski E.R., Burch L.H., 

RA Olsen J.C., Erb L., Weisman G.A., Boucher R.C., Turner J.T.; 

RT "Cloning and expression of a human P2U nucleotide receptor, a target 

RT for cystic fibrosis pharmacotherapy."; 

RL Proc. Natl. Acad. Sci. U.S.A. 91:3275-3279(1994). 

RN [2] 

RP REVISIONS. 

RX MEDLINE=95108098; PubMed=78 09171; 

RA Parr C.E., Sullivan D.M., Paradiso A.M., Lazarowski E.R., Burch L.H., 

RA Olsen J.C., Erb L., Weisman G.A., Boucher R.C., Turner J.T.; 

RT "Cloning and expression of a human P2U nucleotide receptor, a target 

RT for cystic fibrosis pharmacotherapy ; 

RL Proc. Natl. Acad. Sci. U.S.A. 91:13067-13067(1994). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 



RA Puhl H.L. Ill, Ikeda S.R., Aronstam R.S.; 

RT "cDNA clones of human proteins involved in signal transduction 

RT sequenced by the Guthrie cDNA resource center (www.cdna.org) ."; 

RL Submitted (JUL-2002) to the EMBL/ GenBank/DDBJ databases. 
RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney, and Leukocyte; 

RX MEDLINE-22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S. f Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy s!j., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J. , Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: Receptor for ATP and UTP coupled to G-proteins that 
CC activate a phosphatidylinositol-calcium second messenger system. 

cc The affinity range is UTP = ATP > ATP-gamma-S » 2-methylthio-ATP 

CC = ADP. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Spleen, testis, kidney, liver, lung, heart and 
CC brain. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U07225; AAC04923.1; -. 

DR EMBL; AY136753; AAN01279.1; 

DR EMBL; BC012104; AAH12104.1; -. 

DR EMBL; BC028135; AAH28135.1; 

DR HSSP; P34996; 1DDD . 

DR Genew; HGNC:8541; P2RY2. 

DR MIM; 600041; -. 

DR GO; GO:0005887; C:integral to plasma membrane; TAS . 

DR GO; GO:0004872; F:receptor activity; TAS. 

DR GO; GO: 0006873; P:cell ion homeostasis; TAS. 

DR GO; GO: 0007200; P:G-protein signaling, coupled to IP3 second . . .; TAS. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 



DR 

DR 

DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 
PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 
G-protein coupled receptor; Transmembrane; Glycoprotein. 



DOMAIN 
TRANS MEM 
DOMAIN 
TRANSMEM 
DOMAIN 
TRANSMEM 
DOMAIN 
TRANSMEM 
DOMAIN 
TRANSMEM 
DOMAIN 
TRANSMEM 
DOMAIN 
TRANSMEM 
DOMAIN 
CARBOHYD 
CARBOHYD 
DISULFID 
CONFLICT 
CONFLICT 
CONFLICT 
SEQUENCE 



1 
33 
60 
71 
94 
111 
130 
153 
173 
195 
221 
247 
270 
288 
310 
9 
13 
106 
312 
350 
359 
377 AA; 



32 

59 

70 

93 
110 
129 
152 
172 
194 
220 
246 
269 
287 
309 
377 
9 

13 
183 
312 
350 
359 
42289 MW; 



EXTRACELLULAR (POTENTIAL) 

1 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

2 (POTENTIAL). 
EXTRACELLULAR (POTENTIAL) . 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR ( POTENTIAL ) . 

5 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 ( POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 
N-LINKED (GLCNAC. . .) (POTENTIAL) 
N-LINKED (GLCNAC. . .) (POTENTIAL) 
BY SIMILARITY. 

R -> S (IN REF. 4; AAH12104). 
E -> G (IN REF. 1) . 
S -> F (IN REF. 1) . 

EE557A857A269AC6 CRC64; 



Query Match 22.0%; Score 363.5; DB 1; Length 377; 

Best Local Similarity 33.5%; Pred. No. 4.8e-17; 
Matches 94; Conservative 52; Mismatches 128; Indels 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



7; Gaps 5; 

19 KY- YLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI K 77 

HI I : : II I :: :| :| ||:| |:|:|::|| : :||:|: 
32 KYVLLPVSYGWCVLGLCLNAVALYIFLCRLKTWNASTTYMFHLAVSDALYAASLPLLVY 91 

78 SYA-NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAI 136 

II I : IN I : : : I M I I M I I I M : I I : II : : | 

92 YYARGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLRSLRWGRARYAR 151 

137 LISLAVWALVTLEVXPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

:: ill II |:| |: : : I I :: : || : | | :| 

152 RVAGA VWVL VLACQ AP VL Y FVT T S ARGG RVT CH D T SAP E L F S R FVAY S S VMLGL L FAVP F 211 

197 S VMCFFYYKMVVFLKRRSQQQATALPLDKPQRL VVLAVV- - 1 FS I LFTP YHIMRNLRIAS 254 

: I : I I I : : = M I : : : I I I : I : : I I : | : | | : 
212 AVI LVCYVLMARRLLKPAYGTSGGLPRAKRKSVRTIAVVLAVFALCFLPFHA/TRTLYYSF 271 

255 RLDSWPQGC-TQKAIKS I YTLTRPLAFLNSAINPI FYFLMG 294 

I I M II I : I I I I I I I : : | : | | | | 

272 R— SLDLSCHTLNAINMAYKVTRPLASANSCLDPVLYFLAG 310 



RESULT 15 
P2Y4_RAT 

ID P2Y4_RAT STANDARD; PRT; 361 AA. 

AC 035811; 



28-FEB-2003 (Rel. 41, Created) 

28-FEB-2003 (Rel. 41, Last sequence update) 

28-FEB-2003 (Rel. 41, Last annotation update) 

P2Y purinoceptor 4 (P2Y4) . 

P2RY4 OR P2Y4 . 

Rattus norvegicus (Rat) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 
NCBI_TaxID=10116; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=Sprague-Dawley; TISSUE=Liver ; 

Bogdanov Y.D., Wildman S., King B.F., Burntock G. ; 

Submitted (AUG-1997) to the EMBL/ GenBank/ DDB J databases. 

[2] 

SEQUENCE FROM N.A. 

STRAIN=Sprague-Dawley; TISSUE=Brain; 
MEDLINE=98421785; PubMed-9751165 ; 

Webb T.E., Henderson D., Roberts J. A., Barnard E.A. ; 

"Molecular cloning and characterization of the rat P2Y4 receptor."; 

J. Neurochem. 71:1424-1434(1998). 

-!- FUNCTION: Receptor for ATP and UTP coupled to G-proteins that 

activate a phosphatidylinositol-calcium second messenger system. 
Not activated by ADP or UDP. 

-!- SUBCELLULAR LOCATION: Integral membrane protein. 

-!- TISSUE SPECIFICITY: Widely expressed at low levels. In brain, 
higher expression in the pineal gland and ventricular system. 

-!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use i b Y non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; Y14705; CAA75007.1; -. 
EMBL; Y11433; CAA72241.1; -. 
HSSP; P34996; 1DDD. 
InterPro; IPR000276; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 
PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 
G-protein coupled receptor; Transmembrane; Glycoprotein. 
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TRANSMEM 


31 


58 


1 (POTENTIAL) . 


DOMAIN 


59 


68 


CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


69 


91 


2 (POTENTIAL) . 


DOMAIN 


92 


108 


EXTRACELLULAR (POTENTIAL) 


TRANSMEM 


109 


127 


3 (POTENTIAL) . 


DOMAIN 


128 


149 


CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


150 


170 


4 (POTENTIAL) . 


DOMAIN 


171 


192 


EXTRACELLULAR (POTENTIAL) 


TRANSMEM 


193 


218 


5 (POTENTIAL) . 


DOMAIN 


219 


242 


CYTOPLASMIC (POTENTIAL) . 



FT 


TRANSMEM 


9 A "3 

Z.4 J 


9 £ ^ 
ZOO 


FT 


DOMAIN 


266 


283 


FT 


TRANSMEM 


284 


305 


FT 


DOMAIN 




o u _l 


FT 


DISULFID 


104 


181 


FT 


CARBOHYD 


175 


175 


SQ 


SEQUENCE 


361 AA; 


40: 


Query Match 




2: 



6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL). 
BY SIMILARITY. 

N-LINKED (GLCNAC. . .) (POTENTIAL). 
40893 MW; 0377F96E54B449A3 CRC64; 

21.5%; Score 354; DB 1; Length 361; 
Best Local Similarity 30.6%; Precl. No. 1.9e-16; 

Matches 86; Conservative 60; Mismatches 125; Indels 10; Gaps 5; 

26 YAI EFI FGLLGNVT WFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS Y-ANDKG 84 
I I : I : I I I : : : I I : : | : : : | : | : | : : | | : : : | | | : | | : 
Db 3 8 YAVVFVLGLALNAPTLWLFLFRLRPWDATATYMFHLALSDTLWLSLPTLVYYYAARNHW 97 

®y 85 TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILISLAVWA 144 

: I II I : : : I I I I : I I I I I I : I I I : : I | : I I I : | | | 

Db 98 PFGTGLCKFVRFLFYWNLYCSVLFLTCISVHRYLGICHPLFIAIRWGRPRFASLLCLGVWL 157 

QV LVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVMCFFYY 204 

; l ::| I I: : I I :| : :| : :| | : | : | 

Db 158 WAGCLVPNLFFVTTNANGTTILCHDTTLPEEFDHYVYFSSAVMVLLFGLPFLITLVCYG 217 

Qy 205 KMWFLKR RSQQQATAIiPLDKPQRLVVliAVVIFSILFTPYHIMRNLRIASRLDSWPQ 261 

I II : I :: I : I : : : : I : : | | : | | | : :|| 
Db 2 18 LMARRL YRP L P GAGQ S S S RL RS LRT I AWLTVFAVC EVP FH I T RT I Y YQARL LQA 272 

Qy 2 62 GC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 

I : :l MINI II ::|: I || || | 

Db 213 DCHVLNIVNWYKVTRPLASANSCLDPVLYLFTGDKYRNQL 313 



Search completed: August 23, 2004, 17:00:49 
Job time : 27 sees 



